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, ENZYME KINETICS: INHIBITION EQUATIONS DERIVED 
FROM THE INELASTIC COLLISION HYPOTHESIS* 


Joun Z. HEARON AND RoBERT KATZMANt 


Oak Ripce NatTIonAt Laporatory, OAK RipGE, TENNESSEE AND 
HARVARD MEpIcAL SCHOOL, Boston, MASSACHUSETTS 


The inelastic collision hypothesis of enzyme action has been proposed by G. Medwedew. 
) This theory has been here extended to the cases of competitive and non-competitive inhibitors, 
which form enzyme-inhibitor complexes, and to the cases of competitive and non-competitive 
substrates. The resulting equations are discussed and contrasted to those derived from the 
_ classical enzyme-substrate hypothesis. The original formulation of Medwedew neglects the 
presence of the enzyme-substrate complex although the general theory admits the formation 
of this complex. The formulation has been revised and extended to include complex formation. 
The resulting equation is discussed in terms of the usual criteria used to evaluate the Michaelis- 
Menten-Briggs-Haldane equation. 


J+ ue ee i, a 


I. Introduction. There is no aspect of the general theory of enzyme ac- 
_ tion which is better known and more widely accepted than the hypothesis 
4 that one or more enzyme-substrate complexes (hereafter referred to as 
_ES-complex) are formed during the course of reaction. Certainly the char- 
A acteristic features of all current kinetic formulations of enzyme systems 
are direct consequences of this basic assumption. 
G. Medwedew (1937a, b, c) has rejected the ES-complex hypothesis 
_ and erected a rather extensive theory of enzyme action upon the concept 
that the essential mode of enzyme-substrate interaction consists of in- 
elastic collisions. It is a remarkable fact that this theory has received, 
especially at the time it was suggested, so little attention. With the excep- 
tion of one of us (Katzman, 1950) only one author (Wilson, 1949) has 
devoted any discussion to the Medwedew theory. Earlier writers (e.g., 
Haldane, 1930) felt it necessary to discuss the evidence for and against the 
_ES-complex hypothesis. Today the evidence is seldom if ever (cf. Katz- 
man, 1950) marshalled and discussed. In fact those authorities in enzymol- 


ee 


* Taken in part from a thesis submitted by Robert Katzman to the faculty of the Depart- 
ment of Physiology, University of Chicago, in partial fulfillment of the requirements for the 
Degree of Master of Science. 

: { Present address: Neurological Institute, Columbia-Presbyterian Medical Center, New 
York, New York. 


ra | 


260 JOHN Z. HEARON AND ROBERT KATZMAN 


ogy who discuss the point at all dispose of it in what must be regarded as a 
highly cavalier fashion (e.g., Sumner and Somers, 1947, p. 20; Sumner and 
Myrback, 1950, pp. 11-12). To our knowledge there has appeared no logi- 
cal analysis of Medwedew’s theory and in particular no arguments which 
suggest that the theory is untenable. On the other hand, there has been 
made the remarkable and hopeful suggestion* (Northrop ef al., 1948, p. 
48) that if Medwedew’s theory proves to be of general application “the 
kinetics of enzyme reactions will at last be explainable in terms of chemical 
theory.” 

In general much importance is attached to the fact that rate-substrate 
relations derived on the basis of the ES-complex hypothesis satisfactorily 
describe experimental findings. But Medwedew has shown that his theory 
predicts a rate-substrate relation which fits the experimental data, for the 
invertase-sucrose system, as well as does the classical Michaelis-Menten 
equation. Katzman (1950), in a study of ten enzyme systems, has shown 
that the Medwedew equation gives a better fit in three cases while the 
Michaelis-Menten equation gives a better fit in the remaining seven 
cases.t It is possible that refined statistical techniques would lead to 
slightly different results but it is clear that no convincing discrimination 
between the two theories can be obtained in this manner. 

This is a valuable object lesson in formal kinetics (cf. discussion by 
Wilson, 1949) and it is important to reemphasize the following well-known 
facts: Rate-substrate relations are but one set of consequences of a given 
hypothesis of enzyme action. If two rival theories give equally credible 
results in this respect, it is necessary to deduce, and compare, further 
consequences in order to adequately compare the two theories. 

It is the purpose of this paper to exhibit and comment upon the inhibi- 
tion equations derived from Medwedew’s hypothesis and to give analytic 
form to Medwedew’s statements regarding the role of the ES-complex 
within the framework of his theory. 

II. Medwedew’s theory. For purposes of clarity and convenience we 
here state the relevant features of Medwedew’s theory: The assumption is 
made that the rate of reaction, v, is proportional to the concentration of 
enzyme molecules, P, which are engaged in inelastic collisions. The rate of 


* It is our opinion that this suggestion is based on a lack of understanding of the difference 
between Medwedew’s theory and the classical gas-kinetic collision calculation and between 
the latter and absolute rate theory. 


} These statements are based on the relative magnitudes of the sums of squared deviations. 
The least-square parameters were obtained from linear plots and the sums of squared devia- 
tions from the actual rate-substrate plots. 
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change of P with respect to the substrate concentration, S, is asserted to 
be proportional to the concentration, ¢, of enzyme molecules, which are 
available for inelastic collision and are not so engaged. That is, 

ney 

a5 OP: (1) 
where 6 is a constant, P may depend upon the concentration of an in- 
hibitor, and yg always depends upon P and may depend explicitly upon S 
and the concentration of an inhibitor. 

In defining ¢ above we have paraphrased Medwedew’s definition. It is 
important that it be made clear here that Medwedew admits the forma- 
tion of the ES-complex but (in direct contrast to the classical ES-complex 
hypothesis) regards those enzyme molecules which are so combined with 
substrate as inactive or, as we have phrased it, unavailable for (effective) 
inelastic collision. In other words, it is supposed that the formation of 
ES-complex, instead of leading to reaction as in the classical theory, serves 
to tie up or eliminate enzyme molecules which would otherwise participate 
in inelastic impacts which lead to reaction. 

Now it is clear that if ES-complex is formed the concentration of this 
complex can be prescribed, by means of the law of mass action or some 
other relation, as a function of S. It will then be true that, as stated 
above, ¢ will not only depend on S through its dependence on P but will 
also be an explicit function of S. Medwedew also states that the action of 
inhibitors can probably be explained on the basis of the formation of 
enzyme-inhibitor complexes but he is somewhat less than clear as to 
whether this is the only mode of action (admissible within the context of 
his theory) and does not discuss in detail the manner in which the concen- 
tration of ES-complex is to be incorporated into the function ¢ in (1). 

In the absence of an inhibitor and under the assumption that the con- 
centration of ES-complex is negligible, it is evident that ¢ = Eo — P 
where E, is the total enzyme concentration. With the conditions P = 0, 


S = 0, the solution of (1) is 
Patio lt = ex), (2) 
and the reaction rate is given by 
y= gP= V,(1—€) (3) 


where V,, = gE is the maximum attainable rate, asymptotically ap- 
proached as S — , and g is a constant of proportionality. 
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It is equation (3) which Medwedew and Katzman have compared to the 
Michaelis-Menten equation and to which we have referred in the introduc- 
tion. It is worthy of mention here that Medwedew’s theory is completely 
formal in that no statement regarding the actual nature of the propor- 
tionality constants 6 and g is given. In particular, although Medwedew 
devotes much discussion to the temperature dependence of v and right- 
fully criticizes the conventional use of the quantity Qio, nowhere is the 
temperature dependence of v made referable to the temperature depend- 
ence of the parameters 0 and g. 

The formal ‘“‘correspondence’’ between (3) and the Michaelis-Menten 
equation, 

V mS 


— 2 + 
| (4) 


Uv 
can be seen as follows (Medwedew, 1937c; Wilson, 1949; Katzman, 1950). 
The quantity V,, has the same physical significance (maximum attainable 
rate) in (3) and (4). In both cases V ,, is proportional to Eo. We denote by 
Siy2 that value of S for which » = 3V,,. Then from (4) 


Si/2 a KG ; (5) 
while from (3) 


Sie= 5 In2. (6) 
Equations (5) and (6) relate the parameters K and 6 to the experimentally 
observable quantity S12. 

In (4), Vm = kEo where k is the rate constant for the break-down of the 
ES-complex. Thus, in the range where thermal denaturation is negligible, 
the temperature dependence of V , is simply that of k as given by absolute 
rate theory or the simple Arrhenius equation. If (4) is derived on an 
equilibrium basis, then K is the dissociation constant, Ki, of the ES- 
complex. If (4) is derived on a steady-state basis, K = K, + (k)/(A:), 
where &, is the rate constant for the formation of the complex. On either 
basis the temperature dependencies of V,, and K are known and the tem- 
perature dependence of v can be expressed in these terms (Hearon, 1952). 

III. Competitive and non-competitive inhibition. In terms of the Medwe- 
dew theory, those cases in which an inhibitor can combine only with 
enzyme molecules which are neither engaged in collision nor already com- 
bined with an inhibitor molecule will be said to be competitive. Those cases 
in which an inhibitor can combine with an enzyme molecule whether or 
not it is engaged in collision will be said to be non-competitive. Assume that 
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an enzyme-inhibitor complex is actually formed and denote its concentra- 
tion by x; then ¢ is given as 


g=EK,—x—P. (7) 


In the competitive case the value of « is determined by the mass-action 
expression 


t-K.=ol, (8) 


where J is the inhibitor concentration and K, is the dissociation constant 
for the competitively formed complex. From (7) and (8) 


e=> (BoP), (9) 


where f. = 1+ I/K.. From (9) and (1) it is obvious, in view of (3), that 


Hp Vege Li te sey: (10) 


where 2, is the competitively inhibited rate. From (3) and (10) it is evident 
that the effect of the presence of a competitive inhibitor is to replace the 
parameter b by the quantity 6/f.. Defining S‘,, as the value of S for which 
Ve = $V m, it is seen that 


Si/2 


> (11) 
cas =fe.71. 


The form of the Michaelis-Menten equation which corresponds to (10) is 
V os. 


oe eae 12 
TKS yh 
and (11) likewise results from (12) and (4). 
In the non-competitive case, x is determined by 
x-K,=(e+P)I, (13) 


where K, is the non-competitive dissociation constant. From (7) and (13) 


r= Sag gee (14) 


and from (14) and (1) 
| ae 


7 0 td (15) 


yy = 


where f, = 1 + J/K, and 2, is the non- -competitively inhibited rate. Com- 
parison of (3) and (15) shows that the presence of a non-competitive in- 
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hibitor has the effect of replacing the maximum attainable rate by Vniee 
and that for all S > 0, 


are, Oe (16) 


qos Sales (17) 


and (16) also results from (17) and (4). 

Therefore, provided that inhibition is the result of the formation of an 
enzyme-inhibitor complex and the above definitions for competitive and 
non-competitive are accepted, inhibition studies involving these types of 
inhibitors can reveal no differences between the Medwedew theory and the 
classical theory which are not inherent in (3) and (4). 

IV. Inhibition by collision. In discussing competitive and non-competi- 
tive inhibition it has been assumed that the inhibitor, but not the sub- 
strate, forms a complex with the enzyme. There is a third type of inhibi- 
tion which is compatible with the general concepts of the Medwedew the- 
ory. Moreover, logical consistency demands that this type of inhibition be 
examined especially in connection with competitive substrates. 

Assume that the mode of enzyme-inhibitor interaction is identical with 
that of enzyme-substrate interaction. Assume further that an enzyme 
molecule engaged in substrate collision cannot engage in inhibitor collision 
and vice versa. Then if by Y(S, J) we denote the concentration of enzyme 
engaged in inhibitor collisions, (1) becomes 


<cP(S, 1) =b{E—P(S, 1) —¥(S, D1, (18) 


and, similarly, the equation for Y is 


aV(S, I) 6 Ey P15; 1) SY (Spay Ig (19) 


where ¢ is a proportionality constant analogous to 6. The conditions under 
which we seek solutions are 


P(0, 1) =0, (20) 
Y (S,0) =0. (21) 
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Let the Laplace transform with respect to J and the inverse transform be 
defined in the usual way: 


L{F(S, I) } =f (S, ») = [oF (Si neT (22) 

SAS A) HF (OS, 1: (23) 

Then the transformed equations corresponding to (18), (19), and (20) are 
50'S, ») = [= pis, ») —9(S,»)], (24) 

Ay (S, 2) = [2-9 (S,») -y(5, 0], (25) 

p(0,r») =0, (26) 


where and y are the transforms of P and Y respectively and (21) has 
been used in writing (25). Substitution of y from (25) into (24) gives 


d 
oF = 7 a(n) — 5B (n) 8, (27) 
where 
BO) =>2. (28) 
The solution of (27), with (26), is 
p (S, ») =22(1 = e- #005) (29) 


the solution of (18) — (21) for P(S, J) can be formally indicated as 
P(S, 1) =Ey—Eye®I-1{ f (S, 2) } (30) 
and is completed by a the function, F(.S; ), whose transform is 


1 C 
k/cot+y = RictN tes k/e+h 31 
ee ay ne 


where k = bcS. [In writing the identity in (31) a device employed by 
Sangren and Sheppard (1953) has been used.] By known properties of the 
transform (Churchill, 1944; translation theorem, Section 7; convolution 
theorem, Section 14; transform no. 81, Appendix III) 


— I 
F(S, D = et 90(2 VED) +e f e-*So(2 VRE) GE, — (32) 
where $(é) = Jo(it) is the Bessel function of the first kind, zero order, and 


imaginary argument. With (32), (30) becomes 
PU) Sy [1 e SEAS, D1. (33) 
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It is intuitively obvious and readily verified that P(S, 0), from (33), is 
identical with (2). Further, it can be seen, from the asymptotic properties 
of So(#) (Whittaker and Watson, 1946, p. 373), that 


P(«, I) =£o, (34) 

and it can be shown that 
P(S, ©) =0. (35) 
According to (32) and (33), it is true for fixed J and all finite S that 
PUATY GPs, uy, (36) 


but as S increases without limit, P(S, 7) and P(S, 0) approach, according 
to (2) and (34), the same limit, Ho. These are precisely the well-known 
properties of competitive inhibition: For given J, the inhibited curve, 
v; = gP(S, I), lies below the uninhibited curve, v = gP(S, 0), but the two 
curves approach the same limiting rate, Vm = gEo. 

Substitution of P(S, I) from (33) into (18) shows that 


a bE, eS e-4 Fq (2 V/bS cL), (37) 
from which it follows that 
d Yi = ea dv 
> ie % Ae (38) 


where 2; is the inhibited rate and v the uninhibited rate from (3). 
If the inhibited and uninhibited rates from (12) and (4) are denoted by 
v; and v’ respectively, the equation analogous to (38) is 


Ov; eo Yr \—ifd vo 
eo Ut DAG a: (39) 


If the functions v(S, 2) and v'(S, I) are considered as one-parameter 
families of curves, with J as the parameter, the initial slopes of these curves 
are given, as functions of J, by (38) and (39). It is clear that values of c and 
K, can be found such that (38) and (39) agree over a restricted J-range but 
that no values of these constants can be found such that satisfactory agree- 
ment is secured over an extended J-range. For example, if ¢ is approxi- 
mately 1/K., then for J sufficiently small, e~@ is approximately (1 + 
I/K.)~“, but as I increases the functions e~? and (1 +J /K.) must 
diverge. 

It is safe to state that all known cases of strictly* competitive inhibition 


* Cases which are neither strictly competitive nor strictly non-competitive are possible 
and have been discussed (e.g., Botts and Morales, 1953; Green, 1953; Ebersole e¢ al., 1944). 
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are adequately described by (12). Equation (39) is a direct consequence of 
(12), in fact, the reciprocal of (39) is the slope of the familiar linear plot, 
1/v; vs. 1/S, obtained from (12). That this slope, as predicted from (12), 
is a linear function of J has been experimentally verified in many cases. 
By the argument of the above paragraph none of these cases could be satis- 
factorily fitted by the function v; = gP(S, Z) with P(S, I) from (33). 

It is worth noting here that the considerations of this section apply also 
to competitive substrates. For it is a mere technicality that some inhibi- 
tors are not acted upon by the enzyme or, more properly, are acted upon 
at a negligible rate. Many well-known inhibitors are broken down by the 
enzyme at an appreciable rate and are usually termed competitive sub- 
strates. If the “inhibitor” is acted upon by the enzyme then the rate of 
that reaction is proportional to Y(S, J). If p(S, ), from (29), is substi- 
tuted into (25) and the transform y(S, \) so obtained is inverted, the re- 
sult is 


Y (S, I) = Eyes fe # $y (2 VbS ck) dé. (40) 
0 
From (37) 
S a 
P(S, D = bEye# f eX, (2 bEcT) dé. (41) 
0 


From (40) and (41) it is clear that Y(S, J) is obtained from P(S, I) by 
interchanging S and J and 6 and. An alternative form for Y(S, J), easily 
verified by noting that the integral in (32) is Y(S, I) e’°/Eo, is therefore 

VY (S, 1) =Ey(1— e-%G (5, 11, i (42) 


where G(S, J) is obtained from F(S, J) by interchange everywhere in (32) 
of S and J and of 6 and c. 

A discussion, analogous to that given for P(S, I), can be given for 
Y(S, J). In particular, 


Y (0, J) = fl — e "1 (43) 
comparable to (2), and 
| Y(S, ©) =&, (44) 
Ce 04 (45) 
Y (S, I) < Y (0, I), given S, finite J, (46) 


comparable to (34), (35), and (36) respectively. Plainly, equations analo- 
gous to (38) and (39) and analogous discussion follow at once. ' 

V. Role of the ES-complex. In deriving (2) and the subsequent modifica- 
tions (10), (15), (33), and (42) we have, following Medwedew, neglected 
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the formation of the ES-complex. In (10) and (15) the formation of the 
El-complex was admitted and in (33) and (42) the formation of neither 
was admitted. As noted in Section II, if the formation of the ES-complex 
is to be admitted then its concentration, W, must be taken into account. 
There are several ways in which this can be done and clearly all of the 
equations so far derived could be rederived on this basis: However, it is 
proposed in this section to derive expressions for P(S) for an uninhibited 
system. 

The suggestion of Medwedew* regarding the value of W is that W is 
proportional to the substrate concentration. A second assumption is that 
the complex is formed independently of any collisions which are occurring 
but renders future collisions ineffective. An alternative assumption is that 
complex formation and collision are mutually exclusive events, i.e., com- 
plex formation occurs at the collision sites. In all cases 


g=E,—P—W (47) 
for, as noted, it is the contention of Medwedew that enzyme in the form of 
complex is inactive. There are three cases to be considered 

CaseI: W=kS, 
Case II: WK,= (¢+P)S, and 
Case III: WK,=¢S , 


corresponding respectively to Medwedew’s suggestion; the assumption 
that complex formation can occur with enzyme not already complexed; 
and the assumption that complex formation can occur with enzyme 
neither complexed nor engaged in collision. 

For Case I, (1) becomes 


dP 
qs t oP = 6 (Ey—kS), (48) 
the solution of which is 
P=(Eo+5) (1 = es) Ee (49) 


From (48) and (49), P exhibits an optimum value, P* = E, — kS*, where 
S* is the one real, positive solution of dP/dS = 0 from (49). But for suf- 


* Medwedew has invoked the formation of ES-complex to explain inhibition by excess 
substrate and certain experimental results on carboxylase. One of us (Katzman) has been un- 
able to repeat these experiments and, in fact, a rather simple chemical discussion (Katz- 
man, 1950) shows the results reported by Medwedew to be unreasonable. 
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ficiently large S, P becomes negative. From the definition of P this is a 
physically untenable situation. 
For Case II, (1) becomes 


dP _ bKEp 
OS teers tS? om 
the solution of which is straightforwardly obtained as 
Ss be 
Bea bk ine es [= 
oe feet: sale 


With the substitutions s = b(K, + §), wu = 0(K. +S) =a+05S, (51) 
becomes 
P= ake“ [€(u) — E(a)], (52) 


where €(z) is the tabulated (Lowan, 1940) function 
: cigs fae 
E(u) = f as. 
For sufficiently large u, P approaches zero as 1/u, since for large u 
1 re 2 


and the error in (53) is of the order of magnitude of the last term retained. 
From the initial conditions P(0) = 0, and therefore dP/dS = 0 for at 
least one point on the interval 0 < S < @. From (50), there is but one 
extremum for P, viz., P* = aE,/u*, where u* is the value of « at which 
dP/dS = 0. Thus P(S) increases from zero to P* as S increases from zero 
to S* and thereafter approaches zero as S increases. While P(S) remains 
positive for all S there is a basic objection to (52) and to (49): Tf (1) is 
accepted then it is impossible for dP/dS to become negative, for g is a 
concentration and it is physically necessary that ¢ > 0 for all S. The dif- 
ficulty is obvious enough for Case I in which W increases without limit 
with S. In Case IT it is easily shown that Ep) — W = aE)/u and, hence, the 
amount of available active enzyme decreases as 1/u for all values of u. 


For large u, P is [see (53)] 

ak (4+5+4+ ae Dy 
Thus for large u, 

pe — ob ata+ af oe 
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For Case III, (1) becomes 


dP, bKy p_ bKeko 


= (54) 
OS te Rae: Ky+S 


the solution of which is 


P(S) =Eo| 1- (485) |} (55) 


where a, necessarily positive, is defined, as before, as a = DK... From (55), 
P(S) is a strictly monotone increasing function-of S. Moreover, for all 
finite S, d?P /dS? < 0. Therefore, P(S) increases steadily from zero, at 
S = 0, to the limiting value Ey, as S — ~, and exhibits no extremum and 
no point of inflection. 

Comparison of (2) and (55) shows that the initial slopes are identical 
and have the value bE». Thus the curves (2) and (55) leave the origin with 
the same slope and approach the same asymptote, Zo. But for any finite 
K., the curve (55) lies below the curve (2) for all finite S > 0. This is 
easily seen from the fact that 


b w 
Gah REN kha De ee fn (56) 


where z = S/K,. When K,, = 0 the quantity (1 + 2)!" has the value 
unity, and as K, — ~ ands — 0 this quantity increases and approaches e 
as a limit. That (55) approaches (2) in the limit as K,, > © is intuitively 
obvious from the physical hypothesis of Case ITI. 

A comparison of (4) and the rate equation from (55) can be made as 
follows. The rate, v, for Case III is gP(S) with P(S) from (55). Consider 
the function v/V ». Denote this function as obtained from (55) by y and 
the corresponding function from (4) by We. Then 


aN 


Y2= E+S : (58) 
It is evident that if @ = 1 (57) and (58) are formally equivalent and, in 
fact, if (4) is derived on an equilibrium basis they are identical, for then 
K = Ky, is the dissociation constant for the ES-complex. It is certainly 
true that in a large number of cases the experimental data can be satis- 
factorily fitted by (58). The implication in all such cases is that a = 1, or 
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at least that the a-value is very close to unity. Although Medwedew does 
not prescribe the nature of 0 in (1), there is no a priori reason to assume 
that this parameter and the dissociation constant K,, are always related 
in such a way that a = bK,, = 1. On the contrary, accepting the physical 
basis of (57), it is reasonable to assume that when various enzyme and 
substrates are considered a considerable range of a-values will be encoun- 
tered. 

The ways of checking the validity of (58) are many and well known. A 
familiar one is to examine the plot of 1/y2 against 1/S. It is predicted by 
(58) that this plot is linear with slope 


=K. (59) 


The slope of the corresponding plot from (57) is 


1 
a 
nen s _ 
ee ay. ‘laces - co 
S 


Of course (59) and (60) agree if a = 1, but if a ¥ 1 the slope of the 1/y1 
vs. 1/S curve is S-dependent. For small S, i.e., S « Kw, (60) is given up to 
first-order terms in S/K,, by 


v1 _ Ke 


[1-(@-v 2], (61) 


and for large S, i.e., S >> Ku, (60) becomes 


1 

a ta 
Mh (62) 
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As S decreases, (60) approaches 1/0; as S increases, (60) approaches zero if 
a > 1 and increases without limit if a < 1. Therefore the iy M1 curve inter- 
cepts the ordinate axis at 1 / = 1 with zero slope or infinite slope (de- 
pending on whether a > 1 ora << 1) and as 1/S increases the curve ap- 
proaches linearity and takes on slope 1/b. If a = 1 the slope is everywhere 


1/b = Kw. 
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We denote by 5, and S» the S-values for which y; and y2 have the value 
4; then 
Sy Kol tush; (63) 


S,=K. (64) 


Consider the curves obtained by plotting y and y2 against In S. The slopes 
of these curves at In S; and In So, respectively, are 


bya na Fe 

(55 in SJ 3 \diaes |. £3) 

ca ee (66) 
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Equation (58) predicts that the slope of the y2 vs. In S curve, evaluated at 
W. = 4, In S = In Ss, has the value { and, further, this value obtains for 
any system,™ regardless of the chemical nature of enzyme and substrate, 
provided the mechanism is correctly portrayed by E+ S=ES— E+ 
product. On the other hand, (57) predicts that the slope of the Y2 vs. In S 
curve, evaluated at Y2 = 4, In S = In Sy, depends upon the value of a, and 
if a varies from system to system the observed slope will likewise vary. 
A comparative summary of the properties of (4) and (55) is given in 
Table I. 

VI. Discussion and summary. It has been shown that provided an in- 
hibitor acts via the formation of an inactive enzyme-inhibitor complex the 
theory of Medwedew and the classical enzyme-substrate complex theory 
predict essentially the same results: The presence of a competitive in- 
hibitor does not effect the maximum attainable rate (V ,) but increases the 
“apparent Michaelis constant,” or the value of the substrate concentra- 
tion at which one-half the maximum rate is attained [cf. (5) and (6)], and 
this increase is linear in the inhibitor concentration [eq. (11)]. The pres- 
ence of a non-competitive inhibitor does not effect the ‘apparent Michae- 
lis constant” but diminishes the maximum rate in such a way that the 
ratio of uninhibited to inhibited rate is a linear function of the inhibitor 
concentration [eq. (16)]. Therefore, if the Michaelis-Menten equation (4) 
and the Medwedew equation (3) (which, be it noted, neglects the occur- 
rence of the enzyme substrate complex) give approximately the same re- 


* This is in fact the well-known method for the evaluation of » in the model E + 2S= 
ES, — E + products, for which (dy/d In S)s,/, = 1/4. If m > 1, the rate-substrate curve 
exhibits an initial slope of zero and an inflection point at » = Vn/2. Such cases are thus 


easily distinguished from the case m = 1 or from Case III which leads to (55). (See e.g., Gold- 
stein, 1944.) 
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sults, no further discrimination between the two equations can be gained 
by studying inhibitors of the assumed type. 

But there is little logical cogency to a theory which requires a given 
mode of enzyme-substrate interaction and a different mode of enzyme- 
inhibitor interaction. For a substance which diminishes the rate at which 
an enzyme acts upon a given substrate may itself be a substrate. If the 
substance in question is acted upon by the enzyme at a negligibly small 
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rate, we have the classical case of an “inhibitor.” But this is a limiting case 
and any adequate theory of enzyme action must be able to care for the 
influence of one substrate upon the rate of reaction of another, viz., the 
cases of competitive and non-competitive substrates which include, as 
special cases, the classical cases of inhibition when the rate of reaction of 
one substrate is negligible. Such a theory, plainly, must assign one mode of 
interaction to both substrates, i.e., either collision or complex formation. 
The case of competitive substrates leads to rather complex rate expres- 
sions [proportional to (33) and (42) or to (40) and (41)] and while a general 
functional analysis shows that these expressions exhibit qualitatively the 
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main features of competitive inhibition, a detailed discussion of the cor- 
responding fitting problem (which is here difficult, if possible) has not 
been given. What has been achieved is an expression giving the degree to 
which the initial slope of the rate-substrate curve is depressed by a given 
concentration of inhibitor [eq. (38), cf. eq. (39)]. Further, this expression 
[eq. (39)], as obtained from the enzyme-substrate complex theory, is a 
direct consequence of a well-known property of the plot of reciprocal rate 
against the reciprocal substrate concentration and cannot be made to 
agree with the corresponding expression as obtained from the Medwedew 
theory. Equation (39) may be considered to have been verified in many 
cases from the reciprocal plot mentioned above and in a large number of 
additional cases by the fact that, for sufficiently large S(S > K), the 
degree of inhibition is a function only of the ratio of the substrate to 
inhibitor concentration. No such property can be deduced from (33) or 
(41). It should be mentioned that in many studies the only facts verified 
from the reciprocal plots are that the intercept (1/V,) is independent of 
the inhibitor concentration and that the slope is an increasing function of 
the inhibitor concentration. The ES-complex theory demands in particu- 
lar that the slope, K(1 + I/K.)/Vm, is a linear function of I. 

The case of non-competitive substrates has not been discussed in detail. 
However, the situation is quite simple: If two substrates, S; and So, do not 
mutually interfere in so far as collisions are concerned, then P; and Pg, the 
concentrations of enzyme engaged in collision with S; and S:, are deter- 
mined from (1) in the manner in which (2) was derived. That is, 


Prekj {i044 ) Swe ies 


But if S, collisions are ineffective if the enzyme is simultaneously S>- 
engaged, the rate of reaction for S; is proportional to the concentration of 
enzyme engaged in 5}-collision but not engaged in S»-collision. The prob- 
ability of S,-engagement is P,/E» and the probability of no S.-engage- 
ment is 1 — P2/E. Therefore, the rate of reaction for S; is proportional 
to* 

Eyes [1 — e~%sSi] , 


(with a similar expression for the S,-rate) and is to be compared to (17) 
with the understanding that S2 plays the role of the inhibitor, 7. Again the 


* This result and the corresponding result for the rate of reaction for S: can be obtained at 
once in a logically satisfactory manner by solving the system: dai/dS; = biao, da2/dS, = bed, 
das2/dS; = bide, day2/dSz = b3m, where ao+ a + a2 + ae = 1 and ao, a1, a2, diz are the frac- 
tions of the enzyme which are free, Si-engaged, Se-engaged, 5i- and Ss-engaged respectively. 
The assumptions b; = 6} and b, = b express the assumption of no mutual interference. 
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problem is that of securing agreement between an exponential function, 
e~”*:, and a reciprocal linear function, (1 + I/K,)—, with I = Sp. It is 
worth noting here that the results of J. H. Northrop (1922), which were 
assumed to present difficulties for the ES-complex theory, can be ex- 
plained at once if it is assumed that the trypsin substrates gelatin and 
casein are non-competitive, non-interfering substrates. If it is insisted that 
they are competitive, then the formulation of Section V would apply and 
the Medwedew theory would describe the results no better than the ES- 
complex theory. The case of non-competitive, non-interfering substrates is 
trivial: one mol of enzyme behaves as a mixture of one mol of .S;-specific 
enzyme and one mol of S»-specific enzyme. 

Perhaps the most serious defect of the Medwedew theory is the fact 
that equation (3), the only one which he applied to experimental data, was 
derived from (1) on the assumption that, at all substrate concentrations, 
the concentration of the ES-complex is negligible. But the general qualita- 
tive theory provides for the formation of the complex and, indeed, as 
noted in Section II, assigns it the definite role of binding enzyme in in- 
active form. Of the three physical hypotheses (Cases I-III, Section V) 
regarding the manner of formation of the complex, only one proves to be 
physically tenable. That hypothesis (Case III, Section V) leads to an 
equation, (55), which can be made to agree exactly with the so-called 
Michaelis-Menten equation provided the proportionality constant ), in 
(1), and the dissociation constant, K.,, for the ES-complex are so related 
that, for all enzyme-substrate systems considered and under all conditions 
(in particular, at all temperatures), it is true that 6K, = 1. If this condi- 
tion is not met, and it is physically unreasonable to assume that it will be, 
then the Medwedew equation and the Michaelis-Menten equation dis- 
agree in a manner which has been discussed in detail in Section V. In par- 
ticular, if bK,, ¥ 1, the slope of the reciprocal rate against reciprocal sub- 
strate concentration curve, as obtained from the Medwedew equation, is 
not constant and if bK,, is very different from unity it is to be expected 
[eqs. (60), (61), and (62)] that the departure from linearity of the recipro- 
cal plot will be grossly detectable. The ES-complex theory predicts that 
the slope of the curve obtained by plotting v/V m against the logarithm of 
the substrate concentration has the value when evaluated at the point at 
which v/V m is 3. This value is independent of the nature of the enzyme 
and substrate, provided the system obeys the assumed mechanism and 
stoichiometry (see footnote, p. 272), and is independent of the tempera- 
ture. As noted, this is a commonly tested criterion and has been used to 
determine the stoichiometry of enzyme-substrate and enzyme-inhibitor 
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reactions (cf. Goldstein, 1944). The only way in which the Medwedew 
equation can reproduce this property [cf. eq. (65) or item 5 in Table I] is 
for the condition bK,, = 1 to be satisfied independently of the chemical 
nature of the system and of temperature. It is known that most, if not all, 
dissociation constants increase with temperature. It is unreasonable to 
assume that 6 decreases with temperature increase in precisely such a way 
that 6K, is temperature independent and moreover always has the value 
unity. There are in Table I many checkable items, e.g., the temperature 
dependence of $1, (cf. references p. 509, Hearon, 1952), ratio of initial 
slope to slope at Si/2 from items 3 and 4, etc. 

It cannot be said that the Michaelis-Menten equation, or any other 
formal relation, has ever been absolutely validated. Tests of goodness of 
fit and statistical analyses are sometimes (Lineweaver et al., 1934; Eisen- 
hart and Wilson, 1943 and references therein; references in Wilson, 1949; 
Ebersole et al., 1944), but not often performed, and departures from the 
ES-theory, apparent or real, have not been checked with the formulations 
of this paper in mind. However, it appears reasonably clear that the 
Medwedew theory (unless very stringent assumptions are made regarding 
the parameters in the Medwedew equation and their temperature depend- 
ence, about which the Medwedew theory says nothing) will lead to ob- 
vious and grossly detectable departures from the ES-enzyme theory. 

It is not the opinion of the authors that the ES-substrate theory is 
inadequate. But surely the model E + S = ES —> E + products and the 
corresponding rate equation (4) does not describe every enzyme system 
and attempts to force all enzyme kinetics into this framework were 
doomed, as modern biochemistry has shown. While many complex sys- 
tems (involving multiple complexes, carriers, prosthetic groups, etc.) are 
formally described by (4) (Hearon, 1952) this need not be the case. Indeed 
the primary property, exhibited by (4)—that the rate is proportional to 
the total enzyme concentration—is not necessarily retained by more com- 
plex systems, but the precise conditions under which this property is ex- 
hibited have been set forth (Hearon, 1952) (cf. in this connection Molwyn- 
Hughes’ chain-reaction theory and Haldane’s critique thereof, discussed 
by Wilson, 1949). Therefore radical departure from the simple equation 
(4) of Henri-Michaelis-Menten-Briggs-Haldane does not necessarily in- 
fringe the physical hypothesis of the obligatory formation of one or more 
enzyme complexes and perfect obedience to ‘Michaelis-Menten kinetics” 
does not, alone, prove the operation of the simple model on which this 
equation was first based. 


The ES-complex theory has been invalidated in no particular case and 
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has been directly validated in some special cases. Modern rate theory, it is 
generally assumed, has set classical collision theory in proper perspective. 
Still any alternative theory deserves an answer. Medwedew’s theory has 


heretofore been completely ignored in some quarters and uncritically 
accepted in others. 
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This paper is based on N. Rashevsky’s theory of imitative behavior, the underlying idea 
being that performance of one reaction by a given individual produces an increased stimula- 
tion (or tendency) toward the same reaction in other individuals. For simplicity, consideration 
is limited to cases in which each individual may choose only between two (or two main cate- 
gories of) reactions, denoted by A and B in the following. However, upon suggestion from 
Dr. Rashevsky and certainly in better agreement with actual facts, the strength of imitative 
interaction is assumed to vary from individual to individual. More precisely, if ¥; denotes the 
additional excitation caused by imitation in the ith individual, P.4; the probability for perform- 
ance of reaction A, and Pz; the probability for performance of reaction B by the ith individu- 
al, we postulate that 


f= a Pa— >) ByPe,— ah, G=1,2,...N), 


oa os 
k#i k#i 


where the constants a}, and 8% (coefficients of imitative interaction) measure the amount of 
imitative influence exerted by the Ath individual upon the ith, WV being the total number of 
individuals in the population. The term —a;y; accounts for the spontaneous decay of excita- 
tion, and the quantities aj, and 6%, (i ~ k) are assumed to be non-negative. The expressions 
for Pa; and Pz; are obtained from H. D. Landahl’s theory of conflicting stimuli; they depend 
non-linearly on the values yi. 

It is implicit in this formulation that the theory can only be applied if the frequency of con- 
tacts between individuals is not too small. Some further shortcomings and limitations of the 
model are outlined, and the discussion includes suggestions for reinterpretation and improve- 
ment of the theory. 

If all the quantities af, and 6% have the same value, say A, we return to the case treated by 
Rashevsky (and Landau, 1950); these authors, however, replace the sums in the equation above 
by integrals, which automatically restricts the validity of their results to very large values of NV. 
Their work may therefore be characterized by the assumption of uniform interaction in large 
populations. Our equations, on the other hand, are applicable even to very small groups, and 
therein lies one of their main advantages. ; 

In this paper the mathematical properties of the non-linear system of equations above are 
studied with particular reference to the existence and stability of steady states [dy,;/di = 0, 
; ee for the existence of only one stable steady state is derived. Tt may be 
formulated roughly by stating that all the coefficients of interaction should be sufficiently small. 
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If that is not the case, there may exist a greater number of stationary states. In particular, two 
of them (called “extremal’’) have the following properties: they are stable and such that the 
average number of individuals in the group performing one or the other reaction is the largest 
(or smallest) possible as compared with the other steady states. Hence the situation is qualita- 
tively similar to that found by Rashevsky and Landau. Quantitatively, however, important dif- 
ferences may arise, depending on the nature of the matrix specifying the interaction. 

A stable state may be approached through damped oscillations, but this effect is important 
only if the damping is sufficiently small for the oscillations to become practically observable. 
Little information could be obtained on this point, due to mathematical difficulties. 

As mentioned above, the most interesting applications of this theory will be with respect to 
small populations or to populations partitioned into subgroups with varying amounts of imi- 
tative interaction within as well as between groups. 


I. Introduction. N. Rashevsky (1951, chaps. xi, xii, xiii) has suggested 
a simple model to describe how the imitation between individuals affects 
the behavior of a social group. 

We shall assume that the reader is familiar with his work, and also 
with a paper on the same subject by H. G. Landau (1950). 

Underlying Rashevsky’s model is Landahl’s theory of discrimination 
between two conflicting stimuli (cf., e.g., Rashevsky, Joc. cit., chaps. ii, 
iii, iv). Let one individual, say the 7th individual in a group of N, be con- 
fronted with the choice between two mutually exclusive reactions (‘‘reac- 
tion A” and “reaction B”’) to a given external or internal stimulus. Then 
a quantity ®; is introduced which, so to speak, measures the individual’s 
tendency toward performing either reaction. More precisely, it is assumed 
that ®, consists of a constant part &;) and a term ©; describing the fluctu- 
tions of ©; around its mean value ®;. Let p;:(@)d®{ be the (normalized) 
probability distribution of the quantity ®{, and call P4; the probability 
that the zth individual should perform reaction A in the absence of imi- 
tation (“intrinsic” probability of reaction A). Landahl’s theory then 
states that reaction A will occur if ®; > 0 (ie., By + &{ > 0), and re- 
action B if :) + &, < 0.* Hence we have 


Lae 7 1s (®') de’ = u, (@,,) ’ (1) 


and similarly for the probability of reaction B 
Ps; = 1 — u(y). (2) 


In this model, then, each individual is characterized by a certain value 
of ®,o, and by a function ,(®). Rashevsky (loc. cit.) and Landau (loc. cit.) 


* For simplicity, we are assuming that the “threshold” h entering the theory of conflicting 
stimuli should be = 0 (or practically so). See Rashevsky (Joc. cit.) for details. 
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use a distribution function for the quantity ®o (instead of the discrete 
values ®,9). On the assumption that the total number N of individuals in 
the population is very large, they introduce a function which in our nota- 
tion should be called (#0), such that 2(#o)d®) denotes the fraction of 
individuals having a given ®p (within d®y). This constitutes an unneces- 
sary limitation to large values of N, and in fact we shall find it more con- 
venient not to follow their procedure. Hence, wherever an expression of 
the form 


N- ff Fi.) 2 (Gs) dbs 


appears in the papers of Rashevsky and Landau, we shall have instead 
simply a sum, namely, 


SS F (®jo) . 


Our formulation automatically implies that the reactions A and B 
should be performed sufficiently often by each individual, so that it be- 
comes natural to equate probability and actual frequency. Thus, if on the 
average the 7th individual performs reaction A x4; times, and reaction B 
kp, times per day (or any other unit of time), we can write 


Ka; = pila; » (3) 
kp; = piPs;, (4) 

where 
Pe TK As aE KB; - (5) 


For a mathematical treatment of the opposite extreme (imitation and 
spreading of single acts in a large population) cf. a paper by Rapoport 
(1952). pre 

The probability with which reaction A occurs im the whole population is 


N 


so Ka; dete Pa; 


Pa —_——___= (6) 


DS (act a) a 


but, since some individuals will be performing that reaction more fre- 
quently than others, the average number X of individuals performing re- 


action A will be given by 
N 
Xe N°P.,= soe rd (7) 
i=1 
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The average number of those performing reaction B will be 
Yo ees (8) 


In this general scheme the imitation between individuals is introduced 
as follows: The performance of a reaction (A or B) by one individual is 
supposed to cause an additional excitation toward that same reaction in 
other individuals, that is, an increment to the quantities @;. The latter 
are now replaced by 


Ai(t) = Bio + Y(t) (9) 


where the y,’s, which are introduced ad hoc solely to describe the effects 
of imitation, may be called the “imitative excitations.” 

The probabilities P,; and Pz; given by (1) and (2) are likewise replaced 
by u[A-(¢)] and 1— w,{d;(¢)], and the quantities (7) and (8) now become 


X= Dw), (10) 
Y(t) =N-X(t). (11) 


The basic assumption in the theory concerns the dependence of y; on #. 
Rashevsky and Landau put dy;/dt equal to the sum of two terms: 

a) a decay factor —ay; and 

b) a quantity proportional to (X — FY). 
Thus their formula becomes (in our notation): 


== A(X —Y) — ay. (12) 


The constants A and a in (12) do not depend on i, i.e., they are the same 
for all individuals; in fact, the quantity y; itself is implicitly assumed to 
be the same for all individuals and can thus at best represent an average 
value over the whole population. 

It is this latter restriction which we shall remove in this paper. Specifi- 
cally, we suppose that the performance of reaction A or B by other in- 
dividuals of the population gives rise in each individual to an additional 
excitation y(t) whose time variation shall be described by 

ab N N 
Fen Gem Dy Vata Dy sean= obey (13) 


where the y{,’s and 5;;’s are non-negative constants which may be called 
the coefficients of imitative interaction or of mutual influence. We shall 
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also put ¥;; = 5;;= 0 (@ = 1,2, .. . W), expressing the fact that an indi- 
vidual would not imitate himself.* The y,’s and 8/,’s need not be sym- 
metric (i.e., yj, may be different from +{,, and similarly for 6;,), since the 
influence of the 7th individual on the &th may be, and usually is, different 
from that of the &th upon the 7th. All the decay constants a; shall, of 
course, be positive, but not necessarily equal to each other. 

In equation (13) we replace x4, and xg, by their values pxux(Ax(£)) and 
prll — uz(Ax(t))] according to equations (3) and (4). Since we have no 
formula at our disposal indicating how the frequencies p; should vary 
with time, we shall also replace the p;’s by their average values px; this 
will be permissible as a first very rough approximation if the quantities 
px (k = 1, 2,...N) are sufficiently large (which we have already as- 
sumed), and if, moreover, their changes due to imitation effects are rela- 
tively small. Equation (13) then becomes 


ds; WQ_, ¥ 
Fe Dy Yah OO) — D7 A044 OO 49,0. (14) 
=1 k=1 


It shows that, apart from the decay term —a,;, the time rate of change 
of a given individual’s y; is made up by contributions stemming from 
each of the other individuals. The effect of the &th individual on dy,/ 
dt consists in turn of two terms: pxyiite(Ac(¢)) and —p.bi{1 — a(de()], 
so that the intrinsic ®; will tend to be zmcreased by an amount related 
to the &th individual’s probability for reaction A, and, on the other hand, 
to be decreased by an amount depending on the probability for reaction B 
of that same &th individual. It should be clear from our previous remarks 
that the differential coefficient dy;/dt must be understood to mean the 
change of y; in a time interval which is of the same order of magnitude as 
the basic unit of time used in determining the frequencies k,; and kz;. 
Formula (14) reduces to (12) if we put pry = p.0;, = A anda; =a. 
So far, and in strict accordance with Rashevsky’s original formulation, 
we have postulated that only the actual performance of the reaction 
should constitute a stimulus for imitation. But it is of interest for possible 
applications of this theory to consider some more general cases (cf. also 
the discussion in Section V). There may be some after-effects or changes 
connected with a given reaction, which by themselves will elicit that same 
* In a recent work (unpublished) Dr. G. Karlsson postulates that the coefficient of influence 
of one individual on himself should not vanish; instead, it is assumed to indicate the effect of 
past actions on the future actions of the individual himself. We do not introduce a similar hy- 
pothesis, since we are dealing strictly with phenomena of imitation in the usual sense of the 


word. It must also be remembered that, as far as a comparison can Deca Ces at all, Dr. 
Karlsson’s coefficients include both our ®i0’s as well as our vi’s and 64's. 
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reaction in other individuals. Or, to put it differently, as a result of having 
performed reaction A, the ith individual is left in “state A” (as we shall 
say for want of a better terminology), and this latter fact alone may excite 
others as a stimulus for reaction A. Let us give a simple example: a man 
may decide to put on a light coat (reaction A) or a heavy coat (reaction 
B); meeting him with his light coat on (i.e., in ‘‘state A”) may be suffi- 
cient stimulation for someone else to perform reaction A. Clearly this less 
direct type of mutual interaction will appropriately be called imitation, 
since the result is again the performance of one of the two reactions by 
other individuals. We shall therefore improve our equation (14) by add- 
ing to its right side the terms 


N 
pa {v{Oa,— 84 11-Qa,]} , 

=i 
where the y‘;’s and 6,;’s are positive constants and Q4, denotes the 
probability of finding the Ath individual in state A. This latter proba- 
bility will be given by Tia/(Tm + Tiz), where Ti, and Ty are the total 
lengths of time that the &th individual spends in state A and in state B 
respectively. On the other hand, since “state A” is the direct result of 
having performed reaction A, it is natural to assume some relation be- 
tween Pa, and Qa;. As the simplest such relation we shall postulate 


Op=— Pi, (hai? Wy: (15) 


If T4, (and Tp,) denote the average length of time that the &th indi- 
vidual spends in state A (or B) each time after having performed reaction 
A (or B), equation (15) implies essentially that T,, and Ty, should be 
equal (or at least approximately so). Nevertheless, in many cases it will 
not be too bad an assumption, since, in general, we may expect to find an 
individual more often in state A than in state B, if he performs reaction A 
more frequently than reaction B. 

It should also be mentioned that in more complicated cases the perform- 
ance of reaction A might extend over a certain finite period of time and 
actually involve a sequence of actions, each of which would have a definite 
stimulus value toward reaction A for other individuals. Finally, the inten- 
sity of reaction, which we have completely neglected, should also be con- 
sidered in a more complete treatment of our problem. For simplicity, how- 
ever, we shall limit ourselves to the cases discussed above. 

Equation (14) thus becomes: 


ay at 
== De Mit, Oy ()) = > Bis [1—w, (A, ())] —ay,() (16) 
=1 
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with 
Qik = prvi t+ VK, Ghee 
Bix = prdin + 8%, (18) 
an >O0 and 6B, >0 forixk, af, = Bi,=0, (19) 
a;>0. (20) 


Before proceeding any further it is important to point out some other 
restrictions to which our model is still subjected: 

1. We have assumed that only two reactions, A and B, are involved. It 
would, of course, be much more realistic to devise a model in which each 
individual might choose between a greater number, say r > 2, of reac- 
GOSS © Pig: PCa A,). At the end of this paper we shall indicate a 
possible way of approaching this more general problem. Nevertheless, 
quite often we may be particularly interested in one of the reactions (e.g., 
A;) and lump the remaining r — 1 of them together as “‘reaction B.”’ If 
that is the case, the situation could again be treated by means of our 
present simple assumptions. 

2. In equation (13) we have written dy,/di = dd;/dt, implying that 
the ®j’s are constant. Rashevsky (Joc. cit., chap. xii) and Landau 
(Joc. cit.) also postulate that their distribution function (#9) should not 
vary with time. This is a simplifying assumption which may be justified 
as a first approximation when imitation effects play the primary role, and 
it is indeed only to such cases that our model applies. The constants ®;o, 
a.,,, and 3”, may depend on other parameters characterizing the population 
(such as social status or geographical distance*), provided that these 
latter quantities, too, be considered as practically constant if compared 
to the y,’s. Rashevsky (loc. cit., chap. xxiv) and, very recently, Landahl 
(1953) have given simple treatments of cases in which the time variation 
of other quantities besides the imitative excitations y; is of importance. 

Once the functions \,(¢) are known from equations (16), the numbers X 
and Y of individuals performing one or the other reaction can be obtained 
from (10) and (11). This paper will, therefore, be devoted to studying the 
mathematical properties of the non-linear system of differential equations 
(16) which are of the form 

OM GAOT SK) 
where the f;’s do not depend explicitly on /. 

* Compare in this connection a paper by Rashevsky (1953) on imitation effects in non- 
uniformly spatially distributed populations. 
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It is well known that the theory of such systems has been developed 
by many workers, and that two main problems deserve special attention: 

a. The asymptotic behavior of the functions ),(¢) as ¢— ~. Closely 
connected with it is the existence of stationary states for which all the X,’s 
and, therefore, also X and Y are constant. 

b. The stability properties of the stationary states. 

In this paper we investigate particularly the second of these problems, 
since no complete results on the asymptotic behavior of our functions 
.(¢) can be obtained. 

As in the simpler case treated by Rashevsky and Landau, we shall be 
mainly interested in the possibility of a transition from one stable value 
of X (and Y) to a different stable value if the parameters a,,, 6/,, and a; 
undergo some changes due to variations in the external conditions. 


IT, Some preliminary results. Putting 


i, = 6, + By (a,,= 0), (21) 
Nn 

b= 69,— >) Ba: (22) 
k=1 


our equations (16) become 
dhi_ 

aye > Qigte (Ax (4) ) — a;r; (4) +.3;. (23) 
k=1 


If now we introduce instead of the \,’s the new functions 


rth oops (24) 


a; 


the system (23) is further transformed into 


df; % 
a= — aif (!) cht oR (Ke) , (25) 
where 
c4= = 0 if ik, 
‘ (26) 
Ci 0, 
and 


v(t) =u (6454), (27) 
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or, also, 
2(f) =f 9,2) ae (28) 
with 
s/ sof b; 
a.(¢) =P, (— 0%). (29) 


The distribution functions p,(‘) [or ¢;(¢/)] are assumed to be continu- 
ous, non-negative, and such that 


JE ee 


If only two reactions, A and B, are involved, it is natural to assume 
that p:(@;) is a simple Gaussian; the position of its maximum may or 
may not coincide with the value ®,9 (distribution function with or without 
bias). If, on the other hand, “reaction B” includes several possibilities 
A», ...A,, as discussed in the previous section, it may still be possible 
to reduce this case formally to the simple model of two conflicting stimuli 
(this was suggested by Landahl in 1941), but with more complicated ex- 
pressions for the functions ,(6;). However, we have not investigated 
this problem any further. 

We shall assume quite generally that v,(¢;) can be represented as a 
power series in the neighborhood of any point: 


vfs hs) = v(f3) + hige(fs) +... OF) (30) 


for all values of ¢; and for /; sufficiently small. 

The function 2,;(¢;) (¢@ = 1, 2,...N) is >0 and <1 for all values of 
its argument; it is also monotonically non-decreasing. Thus we have 
(Fig. 1b): 

v(—-~7)=0, 2(+e)=1, (Sl) 


v(x) >(y) if «>y. (32) 


Furthermore, 2;(¢;) satisfies the Lipschitz condition 
|o:(x) — 2(y)| < Bilx — y|. (33) 


In fact, denoting by 1/S; the maximum value of q; (see Fig. 1a), relation 
(33) holds for any pair of values « and y if we put B; = 1/5;.* 

We shall obtain a formal solution of our system of equations (25) by 
using the method of successive approximations, originally devised by 
E. Picard (cf., e.g., 1925, page 368). However, it was pointed out by E. 


* We are of course assuming that the quantities 1/5; are finite. 


288 ERNESTO TRUCCO 


Cotton (1910, 1911, 1928; cf. in particular also Bellman, 1947, pp. 363- 
64) that it is often more convenient, instead of starting directly from the 
differential equations, to transform them first into an equivalent system 
of integral equations which in our case are as follows: 


t N 
&: () = ce-%* + 4a,6-% (EC D2 Tine ($7) ) dr, (34) 


61 (0) = ¢; bay 25 ew es 


q(S;) 


3; 
Fic. 1a, b 


It is easy to see from (34) that the quantities ¢,(#) are bounded as 
t— o, For, putting 


N 


Dd cun=Ti>0, (35) 


k=1 
we have, since 0 < »% < 1: 


t 
OSS) — cesitS atest ecitd t 
T=0 (36) 


=T, (1 — e—%#) . 
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We can draw two further conclusions from (36): 
1. The asymptotic mean value of d¢,(t)/dt (defined as 


lim — : oss 
T>0 (T—T ) rT, a 


where Jo is any constant >0) is always zero. In fact: 


. 1 ? dé; (2) : 1 

] fie” = ae ee ————— = ————— . — ® — 
om oT) miei di= lim (T—T) [¢:(7) — §: (To) ] =0 
since {;(T) is bounded. 


2. As shown by Volterra (1931), if all the ¢;(¢)’s tend to a definite limit 
for t—> ©, we must have 


Optra 7201 2 eh, eV (37) 


Volterra’s argument is as follows: It can be seen directly from the original 
equations (25) that if lim ¢.(t) exists (¢ = 1, 2,...), then also lim 
d§ ;/dt will exist. Call this latter limit Z; and assume L; + 0. For ¢ larger 
than some appropriately chosen value 4 we shall have: 


ati oe 
dt oe 


and, therefore, 

6&(@ — §: (4) = L£;- ¢@—-4) for ti>h, 
which cannot be true, since ¢;(#) is bounded. (See also Bellman, 1953, 
Lemma 1, page 77.) 


III. The critical points; the solution of the integral equations (34). In 
general, given the system 


fase feet et lly Desai) et 


any particular solution ({1(é), f2(é),... ¢v(@) may be thought of as 
representing a curve in N-dimensional space (so-called characteristic 


curve). A set of values (m, 72, .. - nv) which satisfy the equations 
film, 72,.--1w) = 0 (fen D2. SV) (39) 


is called a point-characteristic or a critical point. In our case the equa- 
tions (39) become 


N 
n= ae Tine (Mk) « (40) 


k=1 
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It is well known (see, e.g., Lefschetz, 1946, page 40) that the system 
(38) can reach a critical point only at > ©, provided, of course, that the 
initial values (ci, co, . . . ¢v) did not happen to be a solution of (39). 

It is, however, not necessarily true that the system always tends to a 
steady state for {> ©. Thus, for dynamical systems, Birkhoff (1926) 
points out that there exists a set of “central motions” having the property 
that any motion approaches asymptotically one of the central motions. 
However, Birkhoff’s considerations give little indication as to the nature 
of the central motions themselves (for two-dimensional dynamical sys- 
tems the “central motions” are the constant and the periodic ones). On 
the other hand, Volterra (1931, 1935) in studying a simple non-linear 
system of type (38) finds that its solutions remain bounded and oscillate 
indefinitely as t > ~, without ever reaching a limit. This possibility can- 
not be ruled out for our system (25), even though we shall see that it will 
not occur for certain sets of initial values c;. 

Let us first of all investigate the equations (40). We shall try to solve 
them by iteration. Starting from some appropriate set of initial values 
(ni, 9, . . . nh?) we put 


N 
(y) (0) 
ee Sy) oie te (ne ) ’ 

k=1 


and, in general, 
Pls (n) 
n 
Ni = SS cin Oe (me) . (41) 
k=1 


If the sequence »{” converges to some value n{°, then nf (i = 1, 2, 
... WV) will clearly be a solution of the system (40). 
For N = 2 the equations (40) reduce to 


m1 = o1202(n2) , (42) 
n2 = oatr(m) , (43) 


and can be solved graphically by plotting the curves (42) and (43), as 
shown in Figure 2. The first few steps of the iteration 


n{t) = ar2v2(n$") , 


(el) 
2 


n o2101(n{")) 


are also shown in Figure 2, with 9 = o12, 7 = on. 


Some interesting general conclusions can be reached from this simple 
case V = 2. In Figure 3 we illustrate a situation for which the equations 
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(42) and (43) have three solutions, represented by the points A, B, and C. 
A little reflection will show that no matter how we choose the initial set 
(ni, n§), the sequence (m{”, n{) will never converge to the point R 
(unless the initial point should coincide with B itself). Furthermore, if 
n\® and n{° are taken as indicated in Figure 3, the sequence does not 
converge at all, but has instead the two points of accumulation A and C. 

However, returning now to the general case NV > 2, we see that the 


Pa uy ( 1, I> 


_ 


M ee Fae 


Fic.§2 


successive approximations (41) will certainly form a convergent sequence 


if we take 
N 
(0) _ = = tet) (44) 
= opr =k (k ihe ‘ee 


(as we have done in Figure 2). For then, since v(I',) < 1, we have 


Me ¥ eco) 
Ue > Cik= Ni 
k=1 
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and, in general, using (32), it follows from 
ne <a) Cork = 1, 2,2. .N) 
that 
qo) <r MAOL er al, cee 


Thus the sequence 7” is monotonically non-increasing. Since it is obvious- 
ly also bounded (0 < n” < TI), it must converge to a solution of (40), 
sayinh”) = Mie 12a). 


Ms 


N2=%, ila sar 


= Ge %l%2) 


Fie. 3 


Assume now that a solution of (40) different from M; exists; call it é; 
(¢ = 1, 2,... MN). Then, since 


&<T,= nf) (k=1,2,...) 
*Tt is clear that if instead of (44) we had chosen any set of initial values n® such that 
Mx < nf” (hte 1, 2s wn lV) 


the resulting sequence would still have converged to M;, though, in general, not monotonically. 
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we get by induction, using (32), 


fc yy Gee ly 2 ee) (45) 
for all values of x. We therefore conclude from (45): 
Soe el Ch iy Bea (46) 


Now, equation (10) can be rewritten 


N 
X= DF 0: (85) (47) 
i=1 
and, putting ¢; = M;, we see from (46) that the point (My, Mo, ... My) 
gives thelargest possible steady-state value of X (and thesmallest one of Y). 
If, on the other hand, instead of (44) we choose 


no =0 (k=1,2,...N) (48) 


the iteration (41) [with the initial values (48)] will converge to a second 
solution of (40), say n‘°? = m; (i = 1, 2,... NW), because the 7””’s now 
form a monotonically increasing and Potted sequence. In the same way 
as above we see that the m,’s give the smallest steady-state value of X. 

We shall call the two points (7m, m2, ... my) and (Mi, Mo,... My) 
thus obtained by our iteration procedure the ‘‘extremal solutions” of the 
system of equations (40). They may coincide (as in Figure 2); in fact, it 
is easy to give a condition which is sufficient to insure that such will be 
the case. Recalling the Lipschitz condition (33) and the meaning of the 
quantities 1/S;, we may state: If the parameters ay and a, are such that 


N 
as! for. ¢mlf2sn, oN: (49) 


the system (40) has only one solution. For assume that £; and ; (¢ = 1, 2, 
. NV) were two distinct solutions of (40) and denote by |£— »;| > 0 
the largest of the differences |£; — 7:|. Then we have: 


lé;— 31 S >> oy! (Ee) — %(m) |S mo Si | £. — ne | 
k=l = 


1, 


lA 
er 
| 
= 
3 Ms 


which contradicts our hypothesis (49). 
The conditions (49) may be compared to Landau’s (oc. cit.) inequality 
(16) (or to a similar inequality given by Rashevsky, /oc. cit. , chap. xii). 
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We recall that our a,,’s are analogous to Landau’s quantity A, our a,’s to 
his a, and our S,’s to his s. Landau’s relation (16) states essentially that 
his basic equation (15) possesses only one (stable) steady-state solution 
if the expression A/as is smaller than a certain threshold value. In 
contrast to Landau’s case, however, our conditions (49) are merely suff- 
cient, but by no means necessary in order that the system (40) should 


Fic. 4 


have only one critical point. Figure 4 illustrates this for VY = 2: The rela- 
tions (49) [which may be interpreted to mean that the slopes of the two 
curves (42) and (43) should nowhere be too large] are clearly not fulfilled, 
and yet the two curves intersect at one point only. 

On the other hand, it is important to construct analytically (i.e., with- 
out having recourse to purely geometrical methods) at least one example 
for which the two solutions m; and M; do not coincide. Call O and P the 
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points of N-dimensional space whose coordinates are (0,, 0; .i. 0). and 
(Th, P;,... I'y) respectively. We shall not develop the argument into a 
rigorous proof, but we can see that if the hypersurfaces 


N 


PAGE fee RGM nn) = 11 — D> oie rn (me) = 0 (i= i, Hs bean p) (50) 


k=1 


are almost coincident with the hyperplanes 7; = 0 in the neighborhood, 
of the point O and with the hyperplanes 7; = I; in the neighborhood of 


Fic. 5 


the point P, the system (40) will have at least two distinct solutions, one 
of them near to O, and one near to P (cf. Figure 5 for V = 2). 

In general, however, we cannot formulate a criterion to insure the exist- 
ence of more than one critical point. We shall therefore postulate in the 
following that, at least for some values of the parameters o; not satisfy- 
ing (49), the system (40) does have more than one solution, not because 
the contrary case is in any way inadmissible, but simply because it pre-. 
sents little interest. 
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We shall also assume that the Jacobian determinant 


afi 


a = Det | bin — on gu (ne) | 
Nk 


Det | 


does not vanish at any one of the critical points, thus excluding singular 
solutions of our system of equations (40). Each critical point will then be 
an isolated point in the space of the variables (m, m2, . .- ny). Since, 
furthermore, all the critical points lie inside the V-dimensional parallele- 
pipedon 0 < 9; < T; (i = 1, 2,...), their number can, in principle, 
be calculated by a formula due to Picard (1925, chap. vii; cf. also Section 
IV below). 

To obtain the solution of the system (34) by the method of successive 
approximations we form the sequence of functions 


t N 
pee (t) = ¢,e %'+ a; emit f Fa wie Oik Vk cr (7) ») dr 
T=0 k=1 


(51) 
Gem LEA De ae ie n=O eee 

with some suitably chosen set of initial functions (¢{"(7), ¢$(7),... 

60 (7)). The ¢$(2)’s need not satisfy the initial conditions ¢{°(0) = c; 


for, if they don’t, the first and all higher approximations will. 
If we choose the ¢{’s in such a way that lim ¢{(#) exists, we have, 


using de l’Hépital’s rule (cf., e.g., Young, 1910): 
Sete fae (0) x 
Hina Sain Jet De evn te (38 (7)) dr = SY cin (ae?) = a1”, 


where we have put 
: (0) (0) 
lim Ke =m (k=1,2,...N). (52) 
We therefore obtain 
: (n) (n) : 
jim & @) =n (@=1,2,...N), (53) 


the {”’s being the terms of the sequence (41) with the 7{’s now given 
by (52). 

For all values of ¢ < to, where éo is arbitrarily large but finite, we can 
show by a standard procedure (cf., e.g., Hadamard, 1930, p. 297, or Bell- 
man, 1953) that—as n> »—the ¢{")(t)’s will converge to a solution ¢; (é) 
of the equations (34), and that the solution thus obtained is unique. 


STUDIES IN IMITATIVE BEHAVIOR 297 


The same argument that led to (36) shows that no matter how we 
choose the initial functions ¢{(r) 


IPO@l<lel+M=G G@=1,2,...N;n>1). (54) 


It follows in particular from (54) that, denoting by G a positive constant 
larger than any one of the quantities G;;: 


oe — OP SeGe Ga 12.68 ave (55) 


Now the sequence (51) can be replaced by the infinite series 
PODS ero ol, (56) 
n=0 


the &th partial sum of (56) being equal to ¢{"(#). 
Using the Lipschitz condition (33) (with B; = 1/S,) we have for all 
values of m > 1: 


et eat OM 


; SA ik) ln) an) (57) 
Saji ftir DF] te (rr) — f(r) Lar. 
Le p> Sk : : 
We put 
N 
R= G=1,2,..:™), (58) 
k=1 YF 


and denote by H a positive constant larger than each of the quantities 
a;R;. Then we find from (57), using (55), 


t 
1 — 6 |S 2G0R: [dr < 2GHt 
T=) 
and hence, by induction: 


* Hi)*-} 
io? — 1 w | S26 Ee, 


Geel Oy ue LV apt 25 


(59) 


Thus the series (56) converges for all finite values of #, since each of its 
terms is smaller or equal in absolute value than the corresponding term 


of an exponential series. a 
Furthermore, by Weierstrass’ well known criterion (see, e.g., Knopp, 
1947, page 355), the convergence is uniform, from which it follows that 


N t ow <heie I 
tim fei SY oven ($4 (7)) de = fe) ointe (lim Sx" (7) dr. 
no J, _¢ 7=0 k=1 


k=1 
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Hence, letting » > © in (51), we see that the limiting functions €(2) 
of the sequence ¢“(#) (i = 1, 2,... NV) will actually form a solution of 
the system (34). Being the limit of a uniformly convergent sequence of 
continuous functions, each of the ¢;(f)’s is itself a continuous function for 
all finite values of ?¢. 

If there existed a second solution, say ¢;(t) ¥ ¢:(¢), we could, by exactly 
the same reasoning as above, establish the relations 


(Ht) 


n!\ 


Roan «0 ee t= 172,..\.. NE e211 oD 


’ 


and since, for fixed #, the right-hand side of (60) tends to zero asu > ~, 
we conclude that ¢/(¢) and ¢,(£) must coincide. 

We again have a particularly simple situation if the relations (49) hold. 
Choose, for instance, the ¢$(#)’s such that lim Oa 0 (sae 
N). From (55) and (57) we derive the inequalities 


[¢)@) — ¢(D| < 2GR(1 — e-*i*) < 2GR 
and, in general, 
1 eet? Ch) — phy 1 < IGR™ te oe a= hg De ee 


R being the largest of the R,’s. 

Assuming that the conditions (49) are satisfied, we can take R < 1; the 
right-hand side of (61) is then the general term of a convergent geometric 
series, and, therefore, (56) will converge uniformly even as > ©. From 
this it follows that 


jim Jim of ®) = Jim jim 6) = m= Mi. (62) 
In general, however, the two double limits of equation (62) are not equal; 


in fact, the first of them may not exist. 
If we take 


(0) —=Ge . 
ti () =ce% G=1,2,...N), lim &:° () =0 (63) 


the functions ¢{"(¢) form a monotonically increasing sequence, ie., we 
have for each fixed value of #: 


fSVOD-—MMOA>O GH hy eke ay ag ; (64) 
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Using (63) we see that inequality (64) is true for » = 0: 
, A t N 
BPO EPO = ase fe Scum (49 ()) dr 20, 
T= 0 k=1 


and thence it can be derived for all higher values of x. 
Similarly, the ¢{"(¢)’s will form a monotonically decreasing sequence 
if we choose 


Bee et — 2 Cob eeei 
(65) 


lim ell C3 eo 


From this it follows immediately as a slight generalization of the result 
obtained in equation (62) that whenever m; = M; (i= 1, 2,...), 
lim ¢:(é) exists and equals M;. In fact, taking the sequence (51) first with 
the initial functions (63) and then with the initial functions (65), we see 
that for sufficiently large values of ¢ 


—e+m:< fit) < Mite, (66) 


where € is an arbitrarily small positive number. 
Consider now the derivatives 


(n+1) N 
— = —a,f;"*” (1) ta ERODE (67) 


Inserting the expression (51) for ¢,{"*?(¢) and integrating by parts, we find 


agate (2) — Beets \ 
dt 


—a;t eri? (n) ae ey 
tae f 3 ene (st ee ies) 


with N 
— >> int, (cx) (ise. Lie 2s aint. SIN oe 
k=1 


Formula (68) holds for all values of x, including » = 0, if the ¢((7)’s are 


such that ¢(9(0) = c (& = 1, 2,...). 
It is clear from (67) that the see miee dé (t)/dt converges to : 


aike 


as n> oo, 


ate) +a> onde (Se (1) = 
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Let us now assume that for a certain set of initial values (c1, ¢2, . . . Cy) 
the conditions 
pata | ik Les; 


N (69) 
C2 > Tix Vk ( Cx) ) 
k=1 


are satisfied for i = 1, 2,...N. Taking the ¢{°(7)’s as given by (63), 
and thus 
dt (4) _ 
dt 
we find by induction from (68) that all the quantities d¢{"(é)/dt are 
<0. The functions d¢;(t)/dt themselves will therefore be non-positive. 
Hence ¢;(¢), being non-increasing and bounded, must tend to a definite 
limit as t— ©. 
A similar argument shows that lim t.(t) ¢ = 1, 2,...N) exists if all 


=— Ge 9 20 (hime 4 Zee Ne (70) 


the z,’s are <0 [we then use the initial functions (65)]. 

If not all the z;’s have the same sign, however, the functions ¢;(#) may 
approach one of the steady states by damped oscillations,* or, as we have 
already noticed, they may not tend to any limit at all. If and whenever they 
exist, these latter undamped oscillatory solutions add a feature of great 
interest to the original theory of Rashevsky and Landau, for if ¢;(#) [or 
\,(¢)] never reaches a steady state, this means that the 7th individual con- 
stantly fluctuates in his tendencies toward one or the other reaction. The 
important question arises whether eventual undamped oscillatory solu- 
tions are stable or unstable against small external perturbations. In 
this paper, however, we shall not discuss this rather difficult problem, but 
limit ourselves instead to investigating the stability of the steady-state 
solutions. 


* This is not possible for NV = 2 (see Section IV). 
} As an illustration we may cite the two-dimensional system of equations, 
T= ye (atty?—1), 
(71) 


dt me) (S* 4-97 = 2)5 


which has only one (unstable) critical point, the origin. But the circle x + y? = 1 is a stable 
trajectory to which all the other trajectories except the point characteristic (0, 0) tend asymp- 
totically. This can easily be seen by introducing polar coordinates. We then obtain from (71): 


dr 


Feat oc asl 
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IV. The stability of the ‘extremal’ critical points. The existence of un- 
stable steady states. 


A. General considerations. If r; (i = 1,2,... N) are the coordinates of 
a critical point, we put 


fi) = r+ e:(é). (72) 


Expanding as indicated in equation (30) and keeping only first-order 
terms, we obtain for the e;(t)’s the “variational” system 


de; (i N 
a8 es — aie; (¢) iB Ok Gk (TR) * €;, (t) , (73) 


We shall denote by B the matrix associated with (73); the elements of 
B are: 


— G; (i= k) 
Bue= — @:6i%.+ @;0:% Qk (7x) = \ (74) 
@;o%% Qn (Tx) (ik) 


and, as already stated in Section III, it will be assumed that at all critical 
points the matrix B is non-singular: 


det | Bu(ri)| #0. (75) 


We refer to Bellman (1953, pp. 76 ff.) for a definition of stability (as ap- 
plied to a steady state). The conditions b) and c) of Bellman’s fundamen- 
tal theorem 1 (Joc. cit., page 79) are obviously fulfilled in our case; in order 
to show that a steady state is stable, it remains therefore to prove that 
all the solutions of (73) tend to zero for ¢— «. This in turn will be the 
case if all the characteristic roots of the matrix B have negative real part. 

Equally important is Bellman’s theorem 3 (page 88) which states that 
the steady-state solution e;(t) = 0 is unstable if the corresponding matrix 
B has at least one root with positive real part. We see, therefore, that for a 
critical point the stability properties of the complete system of equations 
(25) are determined by its linear approximation, i.e., the system (73), 
provided only that (75) holds. 

We shall have to use the following two well known theorems on ma- 
trices: 

Theorem A: If A = (a) is any matrix whose elements may be com- 
plex, and if we put 


P= >> oul; (76) 


kt 
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all the characteristic roots of A (represented as points in a complex 2 
plane) lie within or on the boundary of the ensemble of circumferences 


|z — ais| =e (77) 


which we shall call the limiting circumferences. 

This theorem was discovered independently many times by several 
workers: for references see O. Taussky (1949) or M. Parodi (1952). 

As pointed out, e.g., by S. Gerschgorin (1931) or H. Rohrbach (1931), 
the boundaries for the characteristic values (i.e., the radii of the limiting 
circumferences) can often be improved by subjecting the matrix A to a 
similarity transformation. 

Theorem B:* Let all the elements ay, (i, k = 1, 2,...N) of the ma- 
trix A be real and >0. Assume furthermore that A is indecomposable, ac- 
cording to the definition given by Debreu and Herstein (1953, page 598). 
Then the matrix A has one root, say 1, which is real, positive, single, and 
larger or equal in absolute value than all the other roots of A. (In the 
following we refer to \; as the “largest root.””) Moreover, the linear sys- 
tem of equations 


Onn = 2; (78) 


M: 


k 


1 


(a1, %2, . . . %y being the unknowns) has a non-trivial solution such that 
all the x,’s are positive. 

If A is decomposable, the largest root \1, though still real, can be >0; 
and it may not always be possible to choose all the x,’s in equation (78) 
positive: they can only be made >0. 

Also, if all the a,,’s are >0 we have 


N N 
; tees <= 5 
Min De tS S Max ae (79) 


B. Stability for N = 2. For N = 2 it is clear that the extremal critical 
points represent stable steady states and that stable and unstable solu- 
tions of the system (40) follow each other in alternate order as the num- 
bers 7; increase from m; to M;. 

Except for the factors a; and a, the matrix elements By, and By are 
the slopes of the curves (42) and (43) at the critical points. The condition 
for stability is given by det |B,,| > 0 and can easily be interpreted ge- 


* This theorem was originally given by O. Perron (see, e.g., Perron, 1951, or also Frobenius, 
quoted by Debreu and Herstein). 
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ometrically. Also, the matrix B has only real roots, so that there are no 
oscillatory solutions of the system (25). 

C. The general case. For N > 2 we consider one of the extremal critical 
points, e.g., 7; = M,, and show that all the characteristic roots of the cor- 
responding matrix B must have negative real parts. The argument would 
be completely analogous for 7; = m,. 

Dividing the ith row of B by a; (i = 1, 2,... N), we obtain a matrix 
C with elements 


Cx = —Oi% + ongn(Ms) 5 (80) 


we shall furthermore denote by F the matrix C + E (E being the unit 
matrix): 
Fu = oingu( Mx) = 0 (i 7 k) 
(81) 
Fi = 0 . 


The system of equations (40) can be expanded in the neighborhood of 
the point M;. Putting 


p=n—M; (82) 
we have 


p= Doula Mit pe) — % | 


x (83) 
. 
= Pix pe 
k=1 
Similarly the system of recurrence equations (41) can be written 
pre? = =F lorie (84) 


where the g‘””’s are of the order of magnitude (p6)2. 
For sufficiently large 7 the relations (84) will therefore be equivalent to 


the linear iteration 


7imtD > Voy oie (85) 


or in matrix notation 


pomtl) = Fm) = Arti, (0) | (86) 
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7 being the vector with components (7{”, 7{,...7\") and Fm 
denoting the mth power of the matrix F. 


If lim +”) = 7 exists, it will be a solution of the linear system of equa- 


mo 


tion tr = Fr; because of (75) we must have 7 = 0. 

The iteration (84) converges to zero for arbitrary positive values of 
the quantities p(”, as we saw in Section III (footnote page 292); the 
vector 7) in (85) will therefore converge to zero as m— o for any 
initial set of values 7(% > 0. This means that the mth power of the matrix F 
itself must tend to zero with increasing m, or, in other words, all the char- 
acteristic roots of F must be smaller than one in absolute value (House- 
holder, 1953, page 38; Debreu and Herstein, Joc. cit.). To obtain this re- 
sult we might also have argued as follows: Let A ¥ 1 be the largest root 
of F, and x; > 0 (but with at least one of the x,’s ¥ 0) the solution of 


N 
DS Fan=Ax; G=1,2,...N). (87) 


(The «,’s are determined only up to an arbitrary constant.) If we put 
pe? = me (88) 
and assume A > 1, the leading term of the equations (84) becomes 
py) = Amp +... (89) 


which cannot be true since lim pS” = 0. Hence A must be <1. 


If A = 1 (a case which we have excluded by hypothesis) the linear itera- 
tion (85) may not converge even though the p\”’s do tend to zero. This 
is always the case for V = 2. Because then if A = 1 the matrix F has the 
two roots +1 and —1; it is therefore indecomposable but not primitive, 
and lim F™ does not exist (cf. Debreu and Herstein, Joc. cit., corollary 
page 605). 

Assume now that F is indecomposable; let x; > 0 be a solution of (87) 
and form the diagonal matrix 


= (xe) . (90) 


Then the characteristic roots of F, C, and B are the same, respectively, 
as those of F’ = X-1FX, C’ = X-1CX, and B’ = X1BX, 

Call a’ the smallest and a’ the largest of the quantities a;. Consider the 
matrix G obtained from B’ by adding a’ to each diagonal element of B’. 
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All its elements are >0: 


Gii=a’—a;>0, 
(91) 
G,=B,,2 0 ((#k), 
N 
DIGu= a’ +a;(A—1). (92) 
k=1 


If h > Ois the largest root of G, the root of B’ (or B) with largest real 
part will be 
h'=h- a’. (93) 


Applying the relations (79) to the matrix G we find, if A < 1: 
a'(A—1)<h’ <a"(A—1) <0, (94) 


and hence the steady state r; = M; is stable. 
If F is not indecomposable, its characteristic equation det |F —\E| =0 
can be written in the form 


Ht det |F,—E,| = 0 


where each of the matrices F,, is either indecomposable or a 1 X 1 matrix 
(Debreu and Herstein, Joc. cit., page 600). 

For each F, there is a largest root A, and a matrix B,; if F, and B, are 
indecomposable they can be treated in the same way as the matrices F 
and B above; if F,, is the 1 X 1 matrix with element zero, the correspond- 
ing B, has one of the numbers —a; < 0 as its characteristic root. For the 
extremal steady states the largest of the A,’s is <1, which implies sta- 
bility. 

D. The Poincaré-Bohl Theorem. For each of the critical points we can go 
through an argument similar to the above, forming a matrix B and a cor- 
responding matrix F whose largest root we again denote by A. Then the 
critical point will be stable or unstable according to whether A is <1 or 
>1. For if A > 1 the relations (94) are replaced by 


Om eA 1) = i Sa(k 1). (95) 


Not all the roots of B can have positive real part, since their sum is 
equal to 
N 
= uO, 
i=1 
i.e., even an unstable state possesses a certain conditional stability (see, 


e.g., Bellman, 1953, theorem 4, page 89). But for practical purposes such 
states may be considered as unstable. 
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The product of all the roots of B is equal to the determinant of B. For 
a stable state, therefore, the sign of that determinant will be (—1)”. If, 
on the other hand, the determinant in question has the opposite sign, 
(—1)+1, the corresponding steady state will certainly be unstable: the 
matrix B must have at least one real and positive root, since complex 
roots (if any) occur in pairs. 

Consider now again the system of equations 


N 
fa (M15 I ooo nn) = ee SS cin Ue (me) = 0 @= 15 °2,. 2% N) (96) 
k=1 


whose Jacobian determinant is det |Cix(m, 72, .. - nv)|, and, as before, 
denote by r; (¢ = 1, 2,...N) the solutions of (96). 

In the space of the variables m1, 72, . . . ny let S be a sphere or any other 
similar closed surface. Then, always assuming that det |Cix(r1, 72, - . - ry) | 
~ 0, the index of Kronecker J relative to the surface S is defined as: 


lT=p — aes (97) 


where k’ is the number of solutions of the system (96) lying within S and 
such that det |Cx(r, 72, . . . ry)| > 0, and &” is the number of solutions 
in S for which det |Cx(n, re, ... ry)| < 0. 

The index J is defined in a similar way for any other system of N func- 
tions of the variables 1, 72, .. . ny. 

We shall use a theorem due to Poincaré and Bohl as given by Hada- 
mard (op. cit., Corollary I, page 223). It can be stated as follows: 

The two sets of functions 


Xi(m, Ny, - 2 e nv) 
and 
Xi(m, m2... 00) + Em, m2)... 0v) (= 1,2,...N) 


have the same index of Kronecker J, if on S we have 


N N 
PIP (98) 
t=1 t=1 


We shall take in particular 


XxX; (nm, N2,- + nN) a is (99) 


N 
£3 (m1, m2)... nw) = >, Tie Ve (mE) STi; (100) 
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and for S a sphere around the origin sufficiently large to contain all the 
points 7; and to insure that the relations (98) are satisfied. 

Now the index J of the functions (99) is obviously equal to (—1)¥, and 
the same must therefore be true of the system (96). 

Thus our differential equations have, in general, an odd number of 
critical points, say 2s + 1, of which at least s correspond to unstable 
steady states (s = k’’ if N even, s = k’ if N odd). 

From the Poincaré-Bohl theorem it also follows that the conditions (49) 
which we had found to be sufficient for the existence of only one critical 
point, are a special case of the following proposition: Whenever the con- 
stants o;, and the functions g,({;) are such that the determinant 


det | — 6+ onge( Cx) | 


does not vanish for any set of values of the variables ¢; (i = 1,2,... NV), 
there exists only one (stable) steady state. For then we have either k’ = 0 
or k” = 0 but always k’ — k” = (—1)%. 

E. Oscillatory approach to a steady state. At any one of the critical points 
it is quite possible that the matrix B might have one or more pairs of com- 
plex roots. If that is the case for a stable critical point, some of the £,(¢)’s 
may reach the steady state by damped oscillations which will be practical- 
ly important (i.e., observable) only if the real part of the root (damping) 
is not too large in absolute value. Let \ = —yu + iv (u > 0) be a com- 
plex root of the matrix B, and assume A < 1. Any criterion for “strong 
damping” is, of course, somewhat arbitrary; we may for instance agree to 


call the damping strong if 
|| 


—<2r 6103) 
iv 


for then the amplitude of oscillation will decrease to (1/e)th of its value 


in one period. 
Each of the roots of B lies within at least one of the limiting circum- 


ferences K;, whose center is at the point (—a,, 0), and whose radius is 
equal to a,A. Therefore (Fig. 6): 


ly) Sas, 
Eel — Se 
from which it follows that 
ito 


V 
im 1—AC 
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Hence if at a stable state in addition to A < 1 the stronger condition 
A/(1— <A) < 2m or 


2a 

A< Tae 3 (102) 
is satisfied, the relation (101) will hold for any of the complex roots, and 
the oscillations will be practically unobservable. Inequality (102), though 
very useful, is clearly a sufficient but certainly not a necessary condition 
for strong damping. Any more precise indication on this subject could 
only be obtained by actually estimating or computing the roots of B in 
each single case. 


Fic. 6 


V. Concluding remarks. It is not the purpose of this paper to deal with 
the difficult problems involved in the practical determination of the co- 
efficients a{,, 8;,, and a; introduced in Section I. Our aim was primarily to 
investigate the mathematical properties of the model, since it will be re- 
membered that Rashevsky (op. cit., chap. xii) writes: ‘“We shall consider 
the simple case in which the constants A and a are the same for all indi- 
viduals. . . . The more general case leads to great mathematical difficul- 
ties.” It seems appropriate to introduce the coefficients of interaction as 
a way of describing an actually existing situation; however, it is clear that 
their values can at best be determined very roughly for small groups under 
special experimental conditions. 

The equations which we have set up, besides being subjected to the 
limitations mentioned in Section I, are still incomplete in many respects. 
In particular, a more complete theory would take into account the inter- 
action between imitative and motivational behavior, as was done by 
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Rashevsky in the latter chapters of his book. One point which deserves 
special attention and which is particularly emphasized by N. E. Miller 
and J. Dollard (1941) is that in general no imitation will occur unless the 
individual has some motive (drive) to perform one of the reactions in 
question, and that imitation, like learning, will not occur if no reward ac- 
companies it; or, if it does occur under conditions of no reward, it does not 
acquire permanent status but is likely to undergo “extinction,” again like 
learning. In contrast to this V. Pareto (1920, page 183, No. 420) dis- 
tinguishes between those cases in which imitative behavior clearly serves 
to satisfy a useful purpose, as against the ‘“‘non-logical”’ cases in which no 
such usefulness is apparent. In this connection, however, it will be well to 
remember the very broad interpretation given by Miller and Dollard to 
the word “reward” (oP. cit., page 29). In any event, it is obvious that the 
concepts of incentive, utility, and reward play a very important role 
whenever imitation is involved. The close connection between imitation 
and learning—which is the main contention of Miller and Dollard’s book 
—is clearly brought out by E. Faris (1937, chap. vii, pp. 73-83); he dis- 
tinguishes three types of imitative behavior, one of which (called by him 
“copying’’) involves the learning of new reactions by imitating a model 
(learning by imitation). In other situations the reactions which are imi- 
tated already belong to the repertoire of actions which the imitator is 
capable of performing. Clearly, our theory would apply rather to this lat- 
ter type of imitation, although it could certainly be extended to include 
copying (in the sense of Faris) as well; we may in fact recall that a satis- 
factory theory of learning can be based essentially on the same neural 
mechanism which underlies Rashevsky’s theory of imitation (cf. also Ra- 
shevsky, op. cit., chap. xxii). Again, all possible gradations in the process 
of imitation itself may occur: from almost exact repetition of a simple 
action to the much vaguer ‘‘transmission of the idea,” or, as Kroeber 
(1952, article 44, pp. 344-57) calls it, “stimulus diffusion.” 

In our model the factors of reward and drive are partially included in 
the quantities @;9, since if an action is constantly rewarded (or punished) 
the individual’s tendency to perform that action will increase (or be sup- 
pressed). We may, moreover, reinterpret the meaning of our constants 
a;, (and @;,) assuming them to include a term which should depend only 
on the index i, not on &, and be a measure for the reward associated with 
the performance of reaction A (or B). This extension of the theory would 
not be too different from that given by Rashevsky (0. cit., chap. xxii) in 
discussing possible interactions between motivational and imitative be- 


havior. 
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If each individual has a choice of more than two reactions, we could, in 
principle, introduce a set of quantities 4{’? such that each of the B"")’s 
should specify the tendency toward performing the 7th reaction; we might 
postulate, e.g., that the 7th reaction should or should not occur depending 
on whether the quantity ¢{” is >0 or <0. Associated with each &{” 
would bea distribution function p;(“”) and a quantity y$” (d) ; the effects 
of imitation would then be described by a set of coefficients similar to the 
a,,’s and B;,’s. The difficulty lies, of course, in the fact that if some of the 
reactions are mutually exclusive, the corresponding ®{"”’s could not be 
simultaneously >0, and therefore supplementary conditions should be 
introduced which would considerably complicate the mathematical treat- 
ment. Rashevsky’s (1950a) attempt to solve the case of three reactions 
(or even more than three) cannot be considered quite satisfactory, and 
therefore the problem remains still open. 

Examples in which only two choices are possible may be found, e.g., 
(at least approximately) in some aspects of feminine fashion (as: long vs. 
short gown) or in political elections involving candidates from only two 
parties. In the latter case some of the coefficients aj’ and Bi’ may be as- 
sumed to describe several methods of persuasion (including perhaps 
bribes and threats) besides the purely imitative factors. 

An interesting quantitative study on the variations of feminine dresses 
over a long period of time (approximately 300 years) has been made by 
A. L. Kroeber (0. cit., No. 42, pp. 332-36, and No. 45, pp. 358-72). He 
finds that the basic elements of style (as opposed to the more voluble and 
variable patterns of mode) show a regular alternation between extreme 
values with a half-period of the order of magnitude of 50 years. Annual 
changes are relatively small if compared to the main “secular” variations. 
Kroeber suggests as an explanation for these findings that “‘there exists a 
basic pattern of women’s dress style toward which European culture of 
recent centuries has been tending as an ideal,” and that “sociocultural 
stress and unsettlement”’ (like periods of war and social unrest) ‘exert 
their influence upon an existing stylistic pattern which they dislocate or 
invert” thus bringing about the opposite extreme. We have not sufficient 
information to state whether Kroeber’s data might equally well be ex- 
plained by a theory along the lines of Rashevsky’s, although the possi- 
bility certainly cannot be discarded. It is worth while noting that Kroeber 
(see end of his article 45) deliberately avoids any explanation in terms of 
psychological factors, such as imitation, emulation, or competition, since, 
he says, they depend on quantities which are unmeasurable and undefin- 
able. The whole work of Rashevsky constitutes an attempt to make just 
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such factors amenable to quantitative treatment, and it is to be hoped 
that this goal will eventually be reached, thus providing an incomparably 
better understanding of the phenomena in question (cf. also Chapple, 
1940). 

We now return to our equations, comparing them to those of Rashevsky 
and Landau. We have already mentioned the importance of eventual os- 
cillatory solutions. If we limit this discussion to the possibility of transi- 
tion from one steady state to another, we see that qualitatively the situa- 
tion is not essentially different from that of Rashevsky and Landau’s sim- 
pler theory. For if originally our relations (49) are fulfilled and therefore 
only one stationary state exists, the system may, under certain circum- 
stances, perform a transition to either one of two extremal states (which 
one being a matter depending on chance or on other factors not accounted 
for in our treatment), if the coefficients of interaction are sufficiently in- 
creased. In general these new states are such that the average number of 
individuals performing one of the reactions will be larger than it was in 
the original steady state. The actual magnitude of the numbers X and Y, 
however, depends critically on the distribution of the quantities 9 among 
the population and on the magnitude of the coefficients aj, or, more pre- 
cisely, on the ratios 


ee eet Get, 0 


/ , 
a Vix Da Bix 
ik k 


It may, therefore, be possible, according to this model, that even a strong 
increase in, say, the a;,’s will bring about only a moderate “jump” in the 
tendencies of the whole group. This would be the case, e.g., if the popula- 
tion were very strongly biased in favor of reaction B (most of the ®j’s 
negative and very large). 
If we have* 
a, = By = A (for all 7’s and k’s) 


and if the @;.’s and the functions p,(¢/) are such that 
N 
N 
i Ux (Bro) = on 
k=1 


* These and the following relations are of course mathematical idealizations. They will 


never be satisfied exactly. 

t This will be the case in particular if the pi(#;)’s are even functions (pili) = bi(— ¢$;)) 
and the ¢io’s are symmetrically distributed among the population (i.e., for each individual 
with a given value of ¢io there is one with the value —¢io). 
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then y,(t) = 0 (i = 1, 2,... NW) will bea solution of the equations (16) 
corresponding to a state in which the population is equally divided in its 
tendencies toward reactions A and B. 

If furthermore all the ¢,0’s are not too different from zero and the func- 
tions p,(¢,) are ~ 0 for 


i< iw, 

it Sa j '>-—¢.,,.+—>_ 8B. 

o; < Pio Ties iy. and for ?; big roe tk 
(Fig. 7), then we have the situation discussed in Section ITI, i.e., the sys- 
tem (16) has at least three critical points; moreover the ‘‘extremal” steady 


Pg;) 


~Vo= ae ~ Gin “Vist ohn 4% 
Fic. 7 
states are such that practically the entire population shows either one or 
the other type of behavior. 
On the other hand, if all the p,(¢;)’s are even functions (a very plausible 


assumption) but the ¢io’s are clustered around a very large positive value 
(strong bias in favor of reaction A) such that 


/ / 1 ~ / 
p(¢)) +0 for <—d,,+—D> 8, (103) 
tk=1 


(Fig. 8), we have 
v(ni)~1 for O<an<Ti, 


and hence the system (40) will have only one solution, expressing the fact 
that the great majority of individuals will perform reaction A, as would, 
of course, be expected. This result does not contradict Landau’s (loc. cit., 
pp. 228-29) statement: “in a sufficiently large population it is possible to 
have a stable condition in which the majority behaves contrary to its own 


desire because of the effect of imitation.” For if N becomes very large 
the sum 


N 
/ 
pay 


k=1 
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will also increase, so that the relations (103) will no longer be satisfied. 

Many different and more complicated situations can arise; it is particu- 
larly two classes of cases which should provide interesting applications of 
the theory and essentially new results, since they are not covered by 
the simpler Rashevsky-Landau model in its original form: namely those 
in which the number N is small* (including the interaction between two 
individuals only), and those in which the differences between the various 
coefficients a,, are very pronounced. As a greatly oversimplified example 
of the latter case, assume that one individual, e.g. the jth, influences all 
the others very strongly toward performing reaction A, while another, 
say the /th, exerts a comparatively strong influence toward reaction B. 


Vio a EX ~ Pio gee 2 Bik gi 
Fic. 8 


We shall assume that all the a’,’s (k #7) and all the 6/,’s (k #1) can be 
neglected as compared to the aj,’s and the 6;,’s (i = 1, 2,... N). Then, 
if all the a,’s are of comparable magnitude and if the number J is not 
too large, the system (40) reduces approximately to 


. . 
n= 2 0 (n) +E y(n) G=1, 2, ..-N5 iG, (104) 
a; i 


, 


B! 
n= v(m) , (105) 
a; 


ee 
nr —2 04 (15) « (106) 
a} 


Equations (105) and (106) represent the interaction between the two 
individuals 7 and J; they may be solved graphically for uD and m1, where- 
upon the remaining 7,’s can be computed from (104). It is reasonable to 


* For small V the numbers X and Y as given by equations (10) and (11) lose much of their 
significance, but the quantities 4; (¢) (or their steady-state values) can be used directly for each 
individual. : 
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assume that the jth individual will be strongly biased in favor of reac- 
tion A (large positive ¢jo), whereas the /th individual will show a strong 
bias toward reaction B; if these ¢;9 and di (<0) are very large, the two 
leaders will not influence each other appreciably and a sizable fraction of 
the population will follow each of them, thus forming two parties. If con- 
ditions are less extreme, one of the two influential individuals, say /, might 
finally be induced to show preponderantly the behavior of the other, in 
which case his coefficients of influence are very likely either to become in- 
significant or even to act in the same direction as those of /, i.e., toward 
reaction A, thus strengthening the tendency of the whole group to act in 
the same way as the jth individual.* 

Of greater interest are cases in which a population is split into several 
groups either by geographical, social, or other types of barriers, there be- 
ing a certain amount of imitative interaction within as well as between 
groups. If the latter is very small (practically zero), the matrix a, can 
be partitioned into submatrices along the main diagonal, with one such 
submatrix for each group. If, for simplicity, we assume conditions similar 
to those of the Rashevsky-Landau theory to hold within each group, it is 
obviously possible that some of the groups will perform predominantly 
reaction A and others predominantly reaction B (still others showing no 
definite preference toward either reaction). Such situations are fairly fre- 
quently met with. As a rather trivial example involving only two large 
groups separated by a geographical barrier, we may mention the fact 
that in the United States it is common practice to eat meat (after having 
cut it) holding the fork with the right hand, whereas in Western Europe 
this is usually done with the left hand. There seems to be, in this case, no 
particular advantage or reward which would tend to favor one of the two 
behaviors, and clearly the interaction between the average American and 
the average European is too small to bring about any decisive change in 
this insignificant detail of everyday life. 

Things might change drastically, however, if the coefficients of influence 
between groups were sufficiently increased. Then, as we have seen, pro- 
vided plausible assumptions are made about the functions pi($;) and pro- 
vided also that the average ¢io’s of the groups are not extremely large in 
absolute value, a steady state will be reached, for which the great majority 
of the total population shows one type of behavior in preference to the 
opposite type. We have then what has been called by Rashevsky (1953) 
the case of “extreme coupling.” 

The tendency to uniformity of behavior for large sections of a popula- 


* On the problem of leadership see also Rashevsky (1950b). 
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tion is widely observed in our times of easy and rapid communication and 
exchange of ideas; our theory might provide a general background for a 
quantitative interpretation of these phenomena. 

All the details of the transition can only be obtained by actually solving 
the system of equations (40) which, as we have already mentioned, would 
imply a great amount of experimental work to obtain some estimate of 
the quantities 0, and ¢.0, a task, though, that can be accomplished in 
principle, e.g., along the lines suggested by Chapple (Joc. cit.). Pertinent 
information obtained from our equations would include not only the num- 
ber of conformists and nonconformists in the total population, but also 
their distribution within each group; furthermore the role played by the 
size of the various groups in relation to the strength of the influence ex- 
erted by the rest of the population. Cases of small groups stubbornly re- 
fusing to adopt the behavior of a vast majority surrounding them are 
sometimes observed. We may cite as examples some small communities in 
Switzerland which speak German although they are completely surround- 
ed by an Italian speaking population, or some Protestant valleys in North- 
ern Italy—an overwhelmingly Catholic country. We should clearly have 
to assume in these cases that the minority groups have extremely high 
values of their ®;9’s. Some caution would, however, be necessary in apply- 
ing our theory, since, e.g., the changing of religion certainly involves 
other factors besides imitation (or, more generally, influence from other 
people). 

Still other situations that can be envisaged would be those in which the 
mutual influence increases only between some of the groups, or the case 
of a certain pattern of behavior spreading from one population to the 
other not directly but by the intermediary of a third group. We see from 
these brief indications that our equations—though still very incomplete 
and fragmentary—might be used to describe mathematically some aspects 
of the spreading of ideas and the diffusion of culture in which imitative 
behavior (more precisely copying) plays a fundamental role, as described 
in great detail by Miller and Dollard (Joc. cit.). 
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Mathematical biology has hitherto emphasized the quantitative, metric aspects of the phys- 
ical manifestations of life, but has neglected the relational or positional aspects, which are of 
paramount importance in biology. Although, for example, the processes of locomotion, inges- 
tion, and digestion in a human are much more complex than in a protozoan, the general rela- 
tions between these processes are the same in all organisms. To a set of very complicated diges- 
tive functions of a higher animal there correspond a few simple functions in a protozoan. In 
other words, the more complicated processes in higher organisms can be mapped on the simpler 
corresponding processes in the lower ones. If any scientific study of this aspect of biology is to 
be possible at all, there must exist some regularity in such mappings. We are, therefore, led to 
the following principle: If the relations between various biological functions of an organism are 
represented geometrically in an appropriate topological space or by an appropriate topological 
complex, then the spaces or complexés representing different organisms must be obtainable by 
a proper transformation from one or very few primordial spaces or complexes. 

The appropriate representation of the relations between the different biological functions 
of an organism appears to be a one-dimensional complex, or graph, which represents the “or- 
ganization chart” of the organism. The problem then is to find a proper transformation which 
derives from this graph the graphs of all possible higher organisms. Both a primordial graph 
and a transformation are suggested and discussed. Theorems are derived which show that the 
basic principle of mapping and the transformation have a predictive value and are verifiable 
experimentally. 

These considerations are extended to relations within animal and human societies and thus 
indicate the reason for the similarities between some aspects of societies and organisms. 

It is finally suggested that the relation between physics and biology may lie on a different 
plane from the one hitherto considered. While physical phenomena are the manifestations of 
the metric properties of the four-dimensional universe, biological phenomena may perhaps re- 
flect some local topological properties of that universe. 


It is not the purpose of this paper, which is intended to be the first of a 
series, to present a theory of any particular biological phenomenon in a 
more or less completed form. It is proposed to examine critically the de- 
velopment of mathematical biology, to point out some of its cardinal de- 
ficiencies, and to indicate entirely new possible directions in which further 
success is likely to lie. 
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Although in the last two decades mathematical biology has achieved 
considerable success in many directions, in others, which are of prime im- 
portance to the biologist, it has been lacking in any accomplishment. 

On the credit side, mathematical theories of a large number of separate 
biological phenomena have been developed very successfully. Not only 
have these theories in all cases represented, sometimes with very great ac- 
curacy, quantitatively known facts, but in a number of cases these theo- 
ries have led to an accurate prediction of the quantitative results of new 
experiments, not previously performed. Thus the theory of cell division, 
based on the theory of the diffusion drag forces developed originally by 
N. Rashevsky (1939, 1948), has led to H. D. Landahl’s equations for the 
rates of elongation and constriction of dividing demembranated Arbacia 
eggs (Landahl, 1942a, b, c, d, e; Rashevsky, 1948), and those equations 
were subsequently verified with amazing accuracy by R. Buchsbaum and 
R. Williamson (1942). Landahl’s (1942c) theory of the return of the arti- 
ficially deformed Arbacia eggs to their original shape agrees very accu- 
rately with data already available of N. Harvey and H. Shapiro (1941). 
The earlier calculations by Rashevsky (1938) of the average size of cells 
and of the variation of cell size with rate of metabolism also agree, though 
with much less accuracy, with available data. However, as has been re- 
cently shown by Rashevsky (1952), the diffusion drag forces theory is def- 
initely inadequate to represent the other known facts of cell division. 
The role of the diffusion drag forces in cell biology is nevertheless of para- 
mount importance, as evidenced by the work of J. Z. Hearon (1949a, b; 
1950a, b, c). 

The mathematical theory of circulation in the hands of G. Karreman 
(1952) and others has led to very interesting and fruitful results. The 
theories of blood-tissue exchanges as developed by M. Morales and R. 
Smith (1944, 1945a, b) and by G. Schmidt (1953) have also led either to 
agreements with available data or to suggestions of new experimental 
procedures. 

In the field of peripheral nerve excitation and conduction, the theoreti- 
cal developments of H. A. Blair (1932a, b), A. V. Hill (1936), George Kar- 
reman (1951), H. D. Landahl (1941c), and N. Rashevsky (1948), to men- 
tion only a few, have led to quantitative agreements between theory and 
experiment for phenomena of excitation by electric currents, for effects 
of different ions, etc. 

Perhaps the most noticeable results were obtained in the theory of the 
central nervous system, in spite of the fact that its complexity seemed to 
preclude the success of any theory. The agreements between experiments 
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and theory in phenomena of reaction times and psychophysical judgment 
(Landahl, 1938, 1939), as well as for discrimination of intensities (House- 
holder, 1939, 1940), would in some cases leave even a theoretical physicist 
satisfied. Landahl’s (1941a, b) theory of learning has led both to a quanti- 
tative agreement with available data and to a prediction of the quanti- 
tative outcome of new experiments with an accuracy that leaves nothing 
to be desired. Of a similar nature is Rashevsky’s prediction of the results 
of the quantitative test of aesthetic preferences of certain geometrical 
forms (Rashevsky, 1948; Rashevsky and Brown, 1944), though the accu- 
racy, while possibly significant, is far less striking than in the cases men- 
tioned above. 

The theory of such complex phenomena as gestalt transposition has 
been developed to a considerable extent (Rashevsky, 1934, 1938a, 1948; 
Culbertson, 1950; Pitts and McCulloch, 1947). Even such phenomena as 
abstract thinking, rational learning, and various mental disturbances 
have been shown to be amenable to mathematical treatment (Rashevsky, 
1948). 

As a somewhat separate branch of mathematical biology stands mathe- 
matical genetics (Haldane 1936; Hogben 1946; Wright, 1931, 1938a, b, 
1945, 1946, 1949, 1950, 1951), a well developed science, the results of 
which are of basic importance to the experimental geneticist. 

Against this array of successes must be weighed the following short- 
comings. First, there are still a large number of biological phenomena to 
which the attention of the mathematical biologist has not been turned. 
While serious, this is the least important shortcoming—time may easily 
remedy it. A very serious shortcoming is this: All the theories mentioned 
above deal with separate biological phenomena. There is no record of a 
successful mathematical theory which would treat the integrated activities 
of the organism as a whole. It is important to know that diffusion drag 
forces may produce cell division. It is important to know how pressure 
waves are reflected in blood vessels. It is important to have a mathemati- 
cal theory of complicated neural networks. But nothing so far in those 
theories indicates that the proper functioning of arteries and veins is essen- 
tial for the normal course of the intracellular processes; nor does anything 
in those theories indicate that a complex phenomenon in the central nerv- 
ous system, by eventually resulting, for example, in the location of food, 
becomes very indirectly, yet intimately, tied up with some metabolic 
process of other cells of the organism. Nothing in those theories gives any 
inkling of a possible connection between a faulty response of a neural net, 
which leads to the accidental cutting of a finger, and the cell divisions, 
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which thus result from a stimulation of the process of healing. And yet 
this integrated activity of the organism is probably the most essential 
manifestation of life. 

So far as the theories mentioned above are concerned, we may just as 
well treat, in fact do treat, the effects of diffusion drag forces as a peculiar 
diffusion problem in a rather specialized physical system, and we do treat 
the problems of circulation as special hydrodynamical problems. The 
fundamental manifestations of life mentioned above drop out from all 
our theories in mathematical biology. 

Against this criticism, one may object that this is also a matter of time. 
When the physicochemical dynamics of a cell are worked out, the dynam- 
ics of interaction of cells, and thus the dynamics of cellular aggregates, 
will become possible. This will eventually lead to the theory of the or- 
ganism. Two different attempts in that direction have been made by 
Rashevsky (1938a; 1943a, b, c; 1944; 1948). Both are, however, failures in 
spite of some possibly interesting mechanisms suggested in them. These 
failures do not mean, of course, that a successful attempt may not actu- 
ally be made. Let us, however, appraise the problem realistically. In 
celestial mechanics, where we deal with forces varying as simply as the 
inverse square of the distance and acting on rigid masses, the three-body 
problem, let alone the m-body problem, still defies in its generality the 
ingenuity of mathematicians. The forces between cells are much more 
complex; they are non-conservative, and the cells themselves are not 
merely displaced but also changed externally and internally by these 
forces. What are the chances within a foreseeable number of generations 
to even approximately master the problem of an organism as an aggre- 
gate of cells, considering that this organism consists of some 10" cells, 
hundreds of different tissues, and thousands of complex interrelated 
structures. Pessimism is not a healthy thing in science, but neither is 
unrealistic optimism. 

But if we abandon the hope of developing a mathematical theory of the 
organism starting at the cellular level, does it mean abandoning the hope 
of developing a physicomathematical theory of the organism and, there- 
fore, of life at all? We shall see below that this is not the case. 

We may look at the whole situation from a different angle. If we may 
for a moment indulge in comparisons (which, in general, is a risky proce- 
dure), we must say that in spite of the sometimes rather elaborate nature 
of some of the theories, mathematical biology is now in a pre-Newtonian 
stage of development. In pre-Newtonian physics there existed, since the 
days of Archimedes, mathematical treatments of some separate physical 
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phenomena. But it was Newton, who, in his laws of motion, introduced 
the first general principles which rapidly were applied to practically all 
domains of physics. In present day mathematical biology we use a great 
number of more or less ingenious mathematical models, based on sound 
physical principles. The use of models is well known to the theoretical 
physicist. But in physics these models are based on the basic principles. 
Newton’s laws were used in 19th century models, like that of the billiard 
ball molecule of the kinetic theory of gases. Einstein’s principles are used 
in contemporary models. A mathematical model is of a transient nature, 
while the general principle, once found, enjoys greater permanency. The 
supplanting of the Newtonian mechanics by general relativity in no way 
discards the former. Newton’s laws of motion are found to be approxima- 
tions, frequently very close approximations to the relativistic laws of mo- 
tion. 

As we have seen, a direct application of the physical principles, used in 
the mathematical models of biological phenomena, for the purpose of 
building a theory of life as an aggregate of individual cells is not likely to 
be fruitful. We must look for a principle which connects the different 
physical phenomena involved and expresses the biological unity of the 
organism and of the organic world as a whole. 

Alfred J. Lotka (1922a, b) has suggested as a general physical principle, 
which may characterize the organic phenomena, the principle of maxi- 
mum flow of energy. According to it, the physical configurations of the or- 
ganism and of the organic world as a whole are determined by the re- 
quirement of maximizing the rates of energy exchanges within the organ- 
isms and between them. Lotka has not elaborated the principle mathe- 
matically. Rashevsky (1948) has attempted this for the organic world as 
a whole, as yet without any success, though interesting possibilities for 
the theory of evolution arise from this application. Still the fundamental 
integration of the organism and of the organic world does not seem to 
follow immediately and naturally from this principle. 

In connection with the theory of organic form, Rashevsky (1943a, b, c; 
1944; 1948) proposed the principle of maximum simplicity as determining 
the sizes of the different parts of an organism and thus its gross shape. 
The principle states that given the intensities of some biological functions, 
for example, the speed of locomotion, the body weight to be supported 
by extremities, or the amount of oxygen to be supplied to the organism, 
the structure of the organism is determined as the simplest possible physi- 
cal design which can perform the necessary function with the required in- 
tensity. In this way the length of extremities, the relative shapes of trunks 
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of quadrupeds, the size of the heart and lungs, and the frequency of heart 
beats and respiration have been calculated, with a general, but not too 
good, agreement with observed data. The application of this approach to 
the structure of the vascular bed by David Cohn (1953) has led to better 
agreement between calculated and observed values, and the principle 
should be considered definitely promising. But even if it should prove 
useful in the future, this principle cannot claim the necessary generality 
which we seek. It leaves the problem of integration practically untouched. 
It is likely to lead to the understanding of the mechanical integration of 
an organism as a locomotor unit, or to the physical integration as a unit 
in which distribution of food and oxygen are concerned, but the problem 
of the general integration of entirely diverse functions seems to be beyond 
its scope. 

There is still a third angle from which we may view our problem and 
which will indicate to us the way to its solution. Thus far mathematical 
biology has emphasized almost exclusively the metric aspects of biological 
phenomena. In the study of physical, as well as of biological, phenomena 
the quantitative aspects are all-important. Every physicist and mathe- 
matical biologist knows that a qualitative statement or prediction is usu- 
ally of very limited value and frequently is meaningless. Without finding 
a quantitative expression for the forces which may produce cell division 
and without comparing them quantitatively with forces that are neces- 
sary to divide a cell, no meaningful prediction can be made. When we ob- 
serve the phenomena of biological integration we notice, however, not 
quantities, varying continuously or discontinuously, but certain rather 
complex relations. In two of its closely related branches, topology and 
theory of groups, mathematics has excellent tools to deal with such rela- 
tions. The application of topology to psychology was advocated long ago 
by Kurt Lewin (1936, 1951), without, however, using any of the actual 
mathematical apparatus of topology. Implicitly topological relations are 
contained in even older considerations of I)’Arcy W. Thompson (1917). 
Thompson’s observations that skulls or other skeletal parts of different 
animals can be “transformed” into each other by simple continuous uni- 
valent transformations amount, in essence, to the statement that the dif- 
ferent shapes of skulls are not only homeomorph but even homotop. 

These are, however, relatively superficial relations. Topological analo- 
gies go much deeper in the realm of the living when we observe not merely 
structural but functional (in a biological sense) relations. The unity of the 
organism and the unity of all life is expressed by just that kind of rela- 
tions. A paramecium performs some relatively simple movements which 
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either bring it in contact with food or with another paramecium, with 
which it conjugates. These relatively simple movements are produced as 
responses to simple chemical stimuli. After coming in contact with food, 
the paramecium ingests and then digests it, excreting undigestible waste. 
It reacts to light and avoids harmful stimuli. The relatively simple re- 
sponses to simple stimuli result in movements which serve either the 
preservation of the individual or that of the species. 

A bird flying after food or after its mate performs much more complex 
movements, which are responses to much more complex sets of stimuli. It 
also performs very complex movements with its larynx which result in 
the production of sounds. The latter have a definite survival value both 
for the individual and the species. Thus again we have a set of movements 
as a response to a set of stimuli, which result in either obtaining food, a 
mate, or avoiding harmful situations. When food is obtained, it is ingested 
and digested by a series of much more complex mechanisms than those in 
a paramecium. But the general pattern is the same. 

A human being performs still more complex movements, serving either 
the survival of the individual or of the species. Considering the existing 
evidence (Jacobson, 1930a, b, 1931) that thought is but a covert speech, 
we notice that the thinking of a scientist or inventor is another very com- 
plex form of minute muscular movements, which much more indirectly 
than in the case of a bird or paramecium contribute to the survival of the 
individual or of the species. And the composition of a love sonnet by a 
poet and its writing down are again another highly complex set of covert 
and overt motions which may result in the finding of a mate and which 
correspond to the much simpler movements of a paramecium that produce 
the same result. The purely vegetative functions of ingestion and digestion 
of food, and other connected phenomena, are very much more complicated 
in the human being, but again follow the same general pattern as that in 
a protozoan. 

Now when we consider the sessile plants, we still find fundamentally 
the same patterns of relations, which are more complex than in a unicellu- 
lar organism but much less complex than in man. Although it is custom- 
ary to say that autotrophic plants manufacture their own food, it is per- 
fectly logical to consider the solar energy and carbon dioxide as the primi- 
tive food, taken from outside. Phenomena of phototropism involve entire- 
ly different mechanisms from those found in the movements of a parame- 
cium or higher animals, but these movements again occur in response to a 
stimulus and result in a better ‘‘contact” with food—the radiant energy 


of the sun. 
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These relations go even further, and many features of animal and, es- 
pecially, human societies show similar relations. This has led to frequent 
considerations of society as a kind of organism. Just as numerous biologi- 
cal functions of a simple cell eventually specialize and are divided between 
many cells of a metazoan, so are different functions of a human being 
specialized and divided between many individuals in a society. 

What we described in the preceding paragraphs is generally known. In 
fact, it is so well known as to be apt to be overlooked. Yet is it this corre- 
spondence that is the essential feature of the organic world and that con- 
stitutes the unity of everything living. 

Speaking in mathematical terms, the situation just described does not 
remind us of homeomorphism or of one-one isomorphism. It rather re- 
minds us of many-to-one isomorphism [mehrstufiger Isomorphismus 
(Reidemeister, 1933, p. 20)], or of homomorphism of groups. To a simple 
physiological function of a protozoan corresponds a large set of similar 
physiological functions in a higher animal. The various complex biophysi- 
cal and biochemical phenomena involved in the act of vision of man corre- 
spond to a simple light sensitivity of the protoplasm of a protozoan. Simi- 
larly, the very large number of complex muscular movements in human 
locomotion corresponds to a relatively simple movement of the protozoan. 
But the relation between the simple light sensitivity process in a protozoan 
and its avoiding motion (Buchsbaum, 1938, p. 20) and the relation be- 
tween the sight of a tiger and the running motion of the man is basically 
the same. 

Merely observing and expressing the above does not help toward its 
understanding. Nor does a mere description in mathematical terms help 
any. But such a description leads us to the search of some regularities in 
the observed phenomena, and the finding of any such regularities may be 
important. 

Situations like the one just discussed are well known in topology. Map- 
pings of one topological space upon another, or of one complex upon an- 
other in such a way that to each point of one space (complex) there corre- 
spond several, eventually an infinite number, of points of the other are 
well known (Lefschetz, 1930, 1949; Seifert and Threlfall, 1934). If we can 
in some way represent the different biological relations in terms of a topo- 
logical space or of a complex, then the fundamental property of life, stated 
in mathematical terms, is this: The topological space or complex, which 
corresponds to any organism, can be mapped upon that of some lower or- 
ganism in such a way that to each point of the space or complex of the 
lower organism correspond many points of the space or complex of the 
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higher. But this is merely a restatement in topological terms of a generally 
known fact, and as such is of very limited use. 

Any mapping in topology is, however, represented by some transforma- 
tion. If we seek for any regularities in the mapping observed in the living, 
regularities which have predictive value, we are of necessity led to the 
following principle: 

The topological spaces or complexes by which different organisms are 
represented are all obtained from one or at most from a few primordial 
spaces or complexes by the same transformation, which contains one or 
more parameters, to different values of which correspond different or- 
ganisms. 

It must be emphasized that there is nothing hypothetical about this 
principle, which, for brevity, we may call the principle of bio-topological 
mapping. If we do not accept it, we admit the absence of any regularity 
in the different mappings which we know to exist. But then the scientist 
has nothing to do. The principle states a known fact coupled with our basic 
belief in the uniformity of nature, without which no science can exist. 

To make any use of the above principle, we must first of all find a proper 
geometrical representation of the relations characteristic of an organism. 
The word “organism” itself provides us with a clue. It derives from the 
same root as “organization.” For organizations in human societies there 
is a standard, well adapted method of geometric representation: the or- 
ganization diagram or chart. Topologically it is a one-dimensional com- 
plex or graph. Moreover it is a directed or oriented graph (Konig, 1936). 
The relations between the different functions of an organism can also be 
represented by a graph, and such graphs are not infrequently used in 
. physiology. 

Unfortunately, we do not know enough even about the simplest proto- 
zoa to describe their graphs in any detail. From the knowledge we have 
we can, however, make some plausible surmises. Let us see what the graph 
of a relatively simple organism may look like. 

It should first be very strongly emphasized that neither the proposed 
graph shown in Figure 1 nor the transformation to be discussed presently 
is to be considered as hypotheses proposed for a development of a theory. 
As such hypotheses they are definitely not good. They are introduced here 
only as illustrations of the basic idea. Even when later on in this paper 
we discuss some biological implications of the suggested transformation, 
we do so only to show what kind of biological implications may follow 
from a future properly selected transformation. It is more than likely 
that an abstract study of different types of transformations hitherto not 
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studied in topology will have to precede any further development of a 
biologically applicable theory. 

The feeding mechanism begins with the contact of the organism with 
food, which we represent by the point C in Figure 1. This is followed by 
the ingestion of food, usually into the food vacuole, which process is repre- 
sented by the point 7, the arrow C — J indicating that J follows C. The 
ingested food is digested, digestion D following J. This is followed by the 


DF 


Fie. 1 


absorption A of the digested food into the protoplasm of the cell, as well 
as by the rejection of indigestible waste, or defecation D;. The absorbed 
food is transported to various parts of the cell where a synthesis S; of the 
body of cell follows. These general synthetic processes result among other 
things in the synthesis D,, of digestive enzymes. This is followed by 
the secretion Su, of the digestive enzymes into the digestive vacuole 
where they come into contact with the ingested food, and digestion D 
results. The directed path JDAS,D..Sa-D, which contains the cycle 
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DAS~D .sSaeD, expresses a very fundamental property of every organism: 
in order that food may be assimilated and the body of the organism syn- 
thetized, the organism must be already present. The convergence of the 
arrows [> D and Sz.-—> D upon the point D expresses that digestion 
cannot take place without previous ingestion of food, nor can it take place 
without digestive enzymes, which presuppose the existence of the organ- 
ism producing them. Similarly, the synthesis S, needs both a previous ab- 
sorption 4, and a transport of the absorbed substance through the pro- 
toplasm of the cell. This inner transport J; in a protozoan usually takes 
place by diffusion. However, protoplasmic streamings or other mecha- 
nisms may conceivably be possible. The above is expressed by the con- 
vergence on S; of the arrows J; > S, and A — S;. The process S; is also 
followed by catabolic processes C;. At this stage the intake J,, of oxygen 
(for aerobic organisms) combines with C; to produce on one hand a release 
of energy £, on the other production P,,, of carbon dioxide, which is fol- 
lowed by the elimination £.., of carbon dioxide, as well as the production 
Py and the elimination W of other catabolic waste products. Some of the 
released energy is necessary to maintain the inner transport /,. If the 
latter involves any special movements, like protoplasmic streamings, 
then £ affects first that movement 7,,,, and the latter results in J;, as 
indicated by the arrows. In the light of present biochemical knowledge 
(Sheer, 1948, p. 32) we should not regard the liberation of energy as di- 
rectly connected with oxygen consumption, and, therefore, we do not con- 
nect [,, with £ on the graph. This in some cases may be a debatable ques- 
tion, which is, however, not essential for our purposes. The graph on 
Figure 1 is rather tentative anyway, and used largely as an illustration. 

A stimulus of some kind coming from the food results in the process of 
stimulation F,. How this process acts in a protozoan is little known. In 
some ways it affects the energy liberation Z£, and through this it produces 
certain movements M,; (cilia, flagella, protrusion of pseudopods), which 
bring the organism in contact C with food. This contact with food again 
acts as a stimulus, which in a similar way produces some movements M; 
which result in food ingestion. A harmful stimulus S;, results in a similar 
manner in a motion M,, the avoiding movement. It is very likely that 
M;, and M, are in some way connected through a mechanism which pro- 
duces a reciprocal inhibition of the two functions. We shall not, however, 
go into this detail. 

Last but not least the body synthesis S:, as well as the release E of 
energy, result eventually in the more or less complicated processes R of 


reproduction. 
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Thus constructed, the graph of Figure 1 strikes us with its complexity. 
Yet the true graph of a protozoan is undoubtedly more complex. We have 
not one digestive enzyme but many, produced through different processes, 
which should be represented by separate points. There is a sensitivity for 
stimuli of different modalities which again complicates the graph. The 
synthesis S, is not a single process, but a series of separate processes. 

The determination of the graph of a unicellular organism is ultimately 
a matter of experimental study. There is, however, a large proportion of 
aprioriness in our construction, which is due to the generally accepted 
definition of an organism. For some non-motile bacteria, for example, we 
may perhaps omit the points F,, M;, and M;. But the chemotropic ac- 
cumulation of some bacteria in regions of higher oxygen concentration is 
of the nature of a stimulus-and-response phenomenon. We hardly would 
speak of something as a living organism if it did not show something es- 
sentially similar to the cycle DAS,D..SaD. 

If we wish to consider, as some do, that the viruses are living uni- 
molecular parasitic organisms, we can do so mainly because we find for 
them a fundamentally similar graph. The attachment of the virus to the 
host cell (Luria, 1953; Pollard, 1953) is due to electrostatic forces. The 
electrostatic attraction plays the role of the stimulus F, and results in a 
movement M; of the virus toward the host, establishing the contact C. 
From here on little is known as to what happens in detail. But the lysis of 
the host is undoubtedly due either to enzymes liberated by the virus mole- 
cule, that is, by parts of molecules that are split off from the virus, or else 
the virus itself. Certain parts of its surface probably act as lytic enzymes. 
In either case the enzymatic action is due to previous synthetic activity S, 
which produced the virus molecule in question. This synthetic activity 
must involve intramolecular displacements and deformations, in which 
the digested food (results of lysis) which must “enter” (point A) the mole- 
cule, participates. Thus we have fundamentally the sequence CDAS;D..D, 
which is obtained from the graph of Figure 1 merely by omitting the 
points J and Sz. But if a penetration of a constituent part into the virus 
molecule occurs, then the point J may also be incorporated. Whether 
through oxidation or otherwise there must be energy liberation by the 
virus molecule in the course of its life cycle and this implies catabolic 
processes. This leads to a part of the graph involving the points Cz, E, W, 
and possibly others. J. von Neumann (1951) in his discussion of self- 
reproducing molecules points out that such a molecule must send out 
information-carrying units into the surrounding medium to pick out of a 
mixture of potential building units the proper ones. Add to this that the 
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building units must be first obtained by breaking down in a proper way 
larger units, which are actually given as “food,” and we formally come 
again to a cycle similar to DAS;D.Sa-D. 

We may now state somewhat more precisely the principle of biotopolog- 
ical mapping: there exists one, or very few, primordial organisms, charac- 
terized by their graphs; the graphs of all other organisms are obtained 
from this primordial graph or graphs by a transformation, which contains 
one or more parameters. Different organisms correspond to the different 
values of those parameters. 

Within this principle a theory of organisms can be developed by study- 
ing different topologically interesting transformations and their proper- 
ties and seeing which of them leads to the best agreement with observa- 
tion. Such was fundamentally the procedure used in the development of 
Newtonian mechanics. Such is also fundamentally the procedure used in 
Einstein’s general relativity theory. Newton’s basic principles, his laws of 
motion, are too general to say anything in a specific situation, until the 
forces are specified as functions of the coordinates, and until all forces 
acting on a given body are specified. A failure of agreement between ob- 
servation and theory, as in the case of the residual motion of Uranus, is 
not laid down to the failure of the basic principle, 

dx 
m= Pe 
but to the inadequate assumption about F. An additional force, due to an 
unknown planet, was to blame for the discrepancy. 

The basic concepts of the general relativity theory, namely, that our 
four-dimensional space-time is non-Euclidian, that its geometry is de- 
termined by the distribution of masses, and that the world-lines of par- 
ticles are geodetics in this non-Euclidian hyperspace cannot lead, impor- 
tant as they are, to any prediction or description of physical facts, until 
equations are established which determine the metric tensor as a function 
of mass distribution. In establishing these equations, a hypothetical ele- 
ment enters. The equations originally proposed by Einstein in 1916 were 
soon modified by him so as to include the cosmological term. Since H. 
Weyl’s attempt to generalize the space-time metric so as to incorporate 
electromagnetic phenomena, numerous modifications have been proposed. 
The question is still unsettled, though the latest generalization proposed 
by Einstein (1952) himself carries great promise. gr 

Our problem is, therefore, to establish within the principle of bio- 
topological mapping proper hypotheses as to the structure of the primor- 
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dial graph and as to the transformation law. In regard to the first we have 
already suggested in Figure 1 a possibility which is at present as good as 
any. We shall return to this question later in this paper. In regard to the 
second part of the problem, observation provides us with a clue. 

Before, however, proceeding with the problem proper we must point 
out that, as many other transformations in topology, the transformation 
looked for is not necessarily to be given in form of an analytic function. A 
well known uniparametric transformation in topology is, for example, the 
one that leads from a given complex to its m-fold covering complex. The 
multiplicity 2 of coverage plays the role of a (discontinuous) parameter. 
But this transformation is not prescribed in form of an analytic function, 
but rather as a set of rules which permit to construct from a given com- 
plex its covering complex, if for the desired coverage we know the sub- 
group of the fundamental group of the original complex (Seifert and Threl- 
fall, 1934, p. 189-91). We shall here also specify our transformation as a 
set of rules which permits the construction in a unique way of more com- 
plicated graphs from the primordial graph. 

Consider a colony of m identical cells, thus all having the same graph. 
The graph of the colony is simply the sum of m identical but distinct 
graphs, and symbolically we may represent, without loss of generality, 
the graph of the colony by a single graph of any one cell. 

The growth of organisms in complexity proceeds through differentiation 
of tissues and through their specialization. A biological function* fi, 
which originally is possessed by all cells, is lost by some of them and re- 
tained by others. These other cells are said to have specialized in the per- 
formance of the function f;. At the same time, as a rule, those cells which 
lost the biological function f; specialize in another biological function f;, 
which in its turn becomes lost in those cells which specialize in f;. A “divi- 
sion of biological functions” is thus established, and instead of » identical 
cells, represented by the original graph, we now have two classes of cells, 
the cells of one class differing from those of the other. The problem is to 
derive the graph of this organism from the graph of the original homoge- 
neous colony. 

We construct first two identical distinct original graphs, i and k (Fig- 
ure 2a). Since we are dealing now with the transformation which should 
be applicable to amy primordial graph, the graphs i and & in Figure 2 are 
drawn without any reference to the hypothetical graph of Figure 1. They 
are just any arbitrary graphs. Let the parts of the original homogeneous 


: To avoid confusion we shall use the word “function”? when referring to a mathematical 
function. “Biological function” will be used in the sense adopted in biology. 
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colony which correspond to graph 7 specialize in fi, those of the graph k 
in fx. In line with the biological facts of specialization, we remove the 
point f, from graph z and f; from graph & (Figure 2b). We shall denote 
the point f; belonging to graph 7 by f{"’, that belonging to graph & by f{", 
with similar notations for other points. Since & lost the biological func- 
tion f\"”, therefore, this function must be taken over by the function f{” 
of 7. In addition to the connection of f‘") in 7 to the point f\” and f< we 
connect f‘" to the points f(” and f\ in the same manner, that is, since 
the arrow in 7 is in the direction f() — f\”, the arrow of the new connec- 
tion is f\ > f?. Similarly we have f{ +f. In the same manner 
f< is connected to f{") and f{, and we obtain the graph shown in Fig- 
ure 2b. 


Fic. 2 
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This, however, does not describe the biological facts in all their gen- 
erality. When a group of cells of an organism specializes with respect to a 
given biological function, the performance of this biological function usual- 
ly becomes more complicated. The biological function breaks up in several 
subsidiary ones. Thus the specialized eye of a higher animal is much more 
complicated than the sensitive spot of the protoplasm of an amoeba 
on which it may be mapped. The number of separate biological functions 
involved in the process of vision of a higher animal is greater than that 
involved in the process of light sensitivity of an amoeba. 

The simplest assumption which we can make about this increase in 
number of subsidiary biological functions is that to each lost biological 
function the cell or tissue acquires one function subsidiary to that in 
which it specializes. Actually the situation may be more complicated, and 
to each lost biological function there may be # subsidiary specialized 
added. Or to qg lost biological functions a number w(qg) subsidiary special- 
ized may be added, where w(q) is a mathematical function of g. Since pres- 
ent day knowledge does not give us any reasons to choose between these 
and other possible assumptions, we shall make here the simplest one. In 
some cases it is likely to be a sufficiently good approximation to actual 
facts. 

On the basis of this assumption, in the case of Figure 2, the complex 7 
receives an additional subsidiary point f{”, while & receives a point f{. 
These we connect to the other points of the graph in the same manner as 
f{? and f{” respectively. Moreover, we assume that f{*) connects with 
f\? through f{? > f{?, and f{% connects with f{® through ff” = fi. 
This latter assumption is again one of the simplest ones, which we adopt 
provisionally. It may perhaps be more likely to assume two way connec- 


tions, such as 
(k) (x) 
Sr ee fr, ’ 


which forms a diangle. But restricting ourselves to the simplest assump- 
tion we find the graph of Figure 2c. 

The latter graph corresponds to an organism in which two biological 
functions are specialized and enhanced. The biological functions f‘ and 
F{P are seen to “belong” to the original graph i, those f and fi? to the 
graph k. Actually, specialization involves not two biological functions, 
but very nearly all of them. Some biological functions, however, notably 
those connected with synthesis of specific cell proteins and with energy 
yielding oxidative processes are not specialized. Thus those tissues which 
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lost the ability to digest food receive digested products from the special- 
ized tissues. But we do not know of a tissue which would have lost the 
ability for its own synthesis, and would receive ready synthesized product 
from another tissue. Nor do we know of a tissue which would have lost 
the ability of performing energy yielding oxidative processes, and would 
receive the energy needed for the maintenance of its life processes from 
other tissues. Thus in the primordial graph there are certain points, which 
correspond to the biological functions that are never specialized and 
which, therefore, do not undergo the transformation shown in Figure 2. 
We shall call those points, which remain unspecialized in a given transfor- 
mation, residual points, and the partial graph composed of them and their 
connecting arrows the residual graph. We shall also speak of residual bio- 
logical functions. 

Mathematically a residual graph may not be a connected one, or it may 
even degenerate into a set of disconnected points, that is, into a sum of 0- 
simplexes. This will happen, for example, if in a square we specialize the 
two diagonally opposite points. Biologically this is not likely to occur. 
Therefore in what follows we assume that the residual graph is connected. 

The subsidiary specialized biological functions are each performed by 
special cells or tissues, which retain their own residual biological functions. 
Therefore, in the case of Figure 2, the points f{") and f{? must be each con- 
nected to a residual graph, which in this case is composed of fi, fm, and fn. 
Thus finally we obtain the graph of Figure 2d, in which the residual graphs 
are designated by 7’ and &’. 

From the very simple graph of Figure 2a we arrived by a transforma- 
tion, which in essence describes the process of cell and tissue specializa- 
tion, to the very complicated graph 2d. True enough, we made a number 
of assumptions, but they were the simplest compatible with observations 
and were largely generalizations drawn from those observations. The re- 
sults become even more complicated when we generalize the transforma- 
tion so as to describe the specialization of m biological functions by m tis- 
sues. The situation is further complicated by the fact that we must con- 
sider the specialization of a class of cells not in one function, but in a group 
of biological functions. Thus in the simplest metazoa we notice the special- 
ization of the ectoderm and entomesoderm in the whole group of animal, 
respectively vegetative functions. Following the same argument as be- 
fore, the transformation rules can be stated thus: 

Let biological functions f, all arbitrarily numbered from 1 to , be- 
come specialized in m groups of 7, M2, . . « Mm points each. We have =n, = 
n. Let group i specialize in 1; biological functions f;, where / designates one 
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of the ; points, and lose n — m; biological functions fr, where k designates 

one of the 2 — m; points. The 2 — n; biological functions f;, are divided 

into m — 1 groups, each consisting of m, M2, . . . Mia, Mitt, - + + Mm points 

respectively. We now formulate the following transformation tules T: 

T,. Draw m distinct primordial graphs. The number m is a parameter of 
the transformation. 

T>. Remove from a graph i (i = 1, 2,...m) those » — n; points which 
correspond to its lost biological functions. 

T3. Connect each specialized point f{"’ of a graph 7 to those remaining 
points f of each graph k (k = 1,2,...i—1,i+1,...m) which 
correspond to those points f{ of graph 7 to which f{” is connected, 
preserving the directions of connections. 

T,. Choose a distribution of x — 1; elements into ; sets, so that the /th 
set has a number v‘” of elements with 


ny 


Sy? =n 1. (1) 


l=1 


Some of the v$”’s may be zeros. This will always happen when 2; > 
n — n;. This arbitrary chosen distribution plays the role of another 
parameter of the transformation. A third parameter is the choice of 
the 1,’s. 

T;. Inagraph7z add — n; subsidiary points, and divide them into ; sets 
containing each »‘") points. Connect each of the v$” points of a set to 
all the other non-residual points in the same way as the specialized 
point f{” is connected to them, preserving the directions of connec- 
tions and also make the connections 

i fiead en ert fi) (2) 

T;. Connect each point f}'), (r = 1, 2,...»{) to a residual graph in 
the same manner as f{") is connected to its own residual graph. 

The complexity of the graph thus obtained seems to defy any visualiza- 
tion. But so does the complexity of a higher living organism. In fact, the 
circumstance that a set of rather simple transformation rules applied to 
a relatively simple original graph leads to such a complicated result indi- 
cates in our opinion that we are on the right track. 

Let us now see how a graph thus obtained can be mapped on the original 
graph. 

The transformed graph contains the m original residual graphs, accord- 
ing to 7). In addition, each of the specialized points gives rise to v; sub- 
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sidiary points. Each of the m primordial graphs receives  — n; subsidiary 
points, according to 7;. The total number of subsidiary points is, there- 
fore, 


> (n—n:;) =n(m—1). (3) 


Each of those, according to 7,, is connected to its own residual graph. 
Thus there are altogether m + m(m — 1) residual graphs. Since they are 
all identical and each is a partial graph of the primordial, therefore, all 
corresponding points and lines may be mapped on the corresponding 
points and lines of the partial graph of the primordial, in an m + n(m — 1) 
to one mapping. 

Of the non-residual points, each f{") has given rise to »; + 1 points f!, 
fi) . ..f{),. Each of these can be mapped on the point f; of the primor- 
dial graph, and because of T; every connection f‘*) — f can be mapped 
on the connection f; > f, of the primordial graph. The connections (2) 
are all mapped on the zero connection of a point to itself. In other words, 
all lines (2) are mapped onto one point. 

To give a measure of the complexity of the transformed graph, we shall 
derive now an expression for the connection number or index (KGnig, 
1936) of the transformed graph. 

Let the primordial graph have a) points and aq, lines. Then its index 
is given b 

= : B=a,;—ao+1. (4) 

Consider a transformed graph, obtained by specializing n biological 
functions in m groups. 

When, according to 7;, we draw m distinct primordial graphs, we have 
ma, points. For each point removed from a graph we add one point, ac- 
cording to T; and T;. This procedure leaves a) unchanged. But at the end 
of the construction, according to 7, and (3), we add n(m — 1) additional 
residual graphs, each consisting of a) — » points. Altogether we add to 
the ma, points another n(m — 1)(a) — ). Altogether the transformed 
graph has 
a, = man tn(m—1) (ao—n) (5) 
points. ; 

Now consider the number of lines. The m primordial graphs contain 
altogether ma, lines. When a point fu is specialized to graph 7, it is removed 
from each of the m —- 1 primordial graphs, except the 7th, and fi? is con- 
nected to each of the m — 1 graphs according to 73. This is the same as 


fusing all points f{” (J = 1, 2, . . . m) into one. If the partial graph, com- 
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posed of the specialized points and their connecting lines, contains a, 
such lines, then the operation 7; results in the fusion of m corresponding 
lines into one. Therefore from ma, lines originally contained in the m pri- 
mordial graphs only a, remain. Thus operations T, and T; result in the 
reduction of the total number ma, of lines by 


(m—1)a,. (6) 


By a homeomorph deformation of the graph obtained by operations T» 
and 73, we can “‘pull in” all the specialized ” points into any one of the 
original primordial graphs. The remaining m — 1 graphs each become a 
residual graph now. Thus the result of operations T, and T3 may be viewed 
as giving one complete primordial graph, each of the m specialized points 
of which is connected with m — 1 residual graphs in the same fashion as 
it is connected with its own residual graph. 

Consider a specialized point f{” originally of degree s, and let it be con- 
nected with 6, other specialized points, so that s, > 6,. Within the com- 
plete graph it is connected to s, points. But to each of the remaining m — 1 
residual graphs it is connected only by s, — 8. lines. Altogether it is now 
of degree s, + (m — 1)(su — Bu) = ms. — (m — 1)B,. After completing 
operations T, and 7;, such a point receives vy") subsidiary points. Each 
of them is connected to other points in the same way as f\””. Therefore 
an additional v{"[ms,, — (m — 1)8.] lines are added. The connection (2) 
for f( add another v{* lines, or altogether 


va” Lm su— (m—1) Bul +o =” [ms +1—(m—1)B) (7) 


additional lines. 

Moreover each of the v{") subsidiary points f( (p = 1,2,..... y) is 
also connected, according to 7s, to an additional residual graph by the 
same number of lines as f\"”, that is, by s, — 8... The number of additional 
lines due to the connection of the v{” subsidiary points to the additional 
residual graphs is therefore 


ye? (Se = Bu) . (8) 


Thus each specialized point f{") of degree s, gives rise to a number of 
additional lines, within the set of the original primordial graphs, equal to 
the sum of (7) and (8), or to: 


ns” [(m+1) sut1— mB). 


Different points f{” have different values of s,, and », the latter being 
prescribed by the distribution in T,. Hence the total number of additional 
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lines due to the » specialized points is obtained by taking the sum of the 
above expression over all the 7 points, or, in other words, over all values of 
u and k. Thus we find the expression 


DS Sx” [(m+41) w+1— ms) . (9) 
k u 


Finally we must add the lines contained in the m(m — 1) additional 
residual graphs. A residual graph is obtained by removing from a primor- 
dial graph the m specialized points. With each such point of degree s, we 
remove S, lines. Thus, denoting by >> s, the sum of all degrees with their 


multiplicities, we would remove 2s, lines, if none of the m points were 
adjacent. If, however, we have a, pairs of adjacent points, connected by 
a, lines, then the a, lines are counted twice in the sum De Su, and therefc re 


the actual number of lines removed is 2s, — a;. Since the primordial 
graph contains a, lines, a residual graph contains a, — Ds, + a, lines, 
and the (m — 1) additional residual graphs contribute 


n(m—1) (a;—ZSy+ ae) (10) 
additional lines. 

Altogether the operation T results in a subtraction from the ma, origi- 
nal lines of (# — 1)a, lines [cf. (6)] and an addition of the numbers of 
lines given by (9) and (10). Therefore the index p’ = a; — a, + 1 of the 
transformed graph is given, because of (5), (6), (9) and (10), by 


pb’ = [m+n(m—1)] p—n(m—1) (s,.—a.) — (m—I1)a, 
+ (m—1) (n®—n—1) +) Syn? (mt 1) set — mp). OD 
k u 


Thus yu’ depends on m and » as well as on the distribution of degrees 
among different points of the primordial graph. Indirectly, through »{”, 
it does depend on the numbers 7. 

For the graph of Figure 2 we have m =n = 2; s, = 2 (u = 4, k); 
y) = py) = 1; p= 2; a, =0; Be = Bi = 0. Equation (11) gives for 
the transformed graph 2d the value y’ = 15. This can be verified by in- 
spection. 


We now shall prove 
Theorem 1. The transformation rules T, when applied to two different 


primordial graphs, result, for the same choice of the parameters, in dif- 


ferent graphs. 
We define two graphs as different if either one of them contains at least 
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one connection which is not contained in the other. Connections a — 6 
and 6 — a are considered as different. 

Consider two different primordial graphs, J and JJ. Let fifo be a connec- 
tion of J, not contained in JT. Suppose fife belongs to the residual partial 
‘graph. Since the transformed graph contains m+ n(m — 1) residual 
graphs, therefore, there will be m-+ »(m — 1) connections in the trans- 
formed graph J which are not contained in the transformed graph J. 
The two will be different. Let now fif2 connect in J either two specialized 
points or a specialized point with a point of the residual graph. In the 
transformed graph of J there will be a connection between the points that 
are mapped on f, and fo correspondingly, according to 7;. But in the trans- 
formed graph JT there will be no such connections. Hence the two graphs 
are different. 

Theorem 1 is important because it demonstrates the verifiability and 
predictive value of the theory. From that theorem there follows as a corol- 
lary that it is possible to determine in principle the original primordial 
graph, if a given transformed graph and the transformation rules are 
known. This possibility in principle is, however, of no practical value, be- 
cause of the tremendous complexity of the graphs of the higher organ- 
isms. The following theorem 2 is of more practical value. 

Theorem 2. The transformation rules T transform any partial graph of 
the primordial in such a way that points of the transformed graph which 
correspond to the points of the original are connected in the same manner 
as those of the original. 

The theorem is again a consequence of 7;. 

Because of that theorem, we may choose an experimentally convenient 
small number of biological functions in an organism, and compare the par- 
tial graph to the corresponding partial of the primordial. 

In a manner similar to that used in deriving expression (11), we may 
study other topological properties of the transformed graphs, which may 
have biological significance. Expression (11) gives us the total number of 
closed circuits in the transformed graph. A closed circuit of the type shown 
in Figure 3a is of a different biological significance from one shown in 
Figure 3b. As remarked above (p. 327), a circular circuit, as one in Fig- 
ure 3a, implies fundamentally the self-perpetuation of a set of functions, 
and the impossibility of perpetuation when any member of the set is 
missing. In the circuit of Figure 3b the convergence of arrows on f,; may 
mean either that the biological function f, can be performed only if fy or fs 
are performed, or if either of them is. In the second instance the circuit 
implies an extra stability, a coefficient of safety. One of the next steps in 
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the development of the theory of bio-topologic mappings is to study how 
the transformation T affects circuits of different types. The problem of 
point-bases and line-bases (Konig, 1936) and their transformation may 
also have a definite biological significance. 

It must be kept in mind that the transformation T does not determine 
the physico-chemical nature of the subsidiary points. Thus the secretion 
of an inactive substance may give rise in the process of the specialization 
to the secretion of venom by a snake, or to the secretion of odorous sub- 
stances, as in the case of a skunk. The specialization of some metabolic 
processes may lead to a subsidiary point which represents a chemolumines- 
cent reaction in some animals, though in others the subsidiary point may 
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represent a non-luminescent reaction. The biologist determines the biologi- 
cal homology of some apparently diverse functions from comparative 
anatomical and physiological studies. The studies of the organism’s graph 
together with the knowledge of the proper transformation will be of great 
help in those problems. 

The transformation JT does not, of course, tell the whole story. It may 
be applied in steps. First one or a few biological functions become special- 
ized, then others. Some general rule must now be looked for, which de- 
termines the sequence of different steps. Again we have some clues in ob- 
servation. The earliest specialization seems to be in the separation of the 
two groups of biological functions, the animal and vegetative. This sepa- 
ration is closely related with the formation of the two embryonic layers 
—the ectoderm and the entomesoderm, which are the only two layers in 
Coelenterata. A differentiation of the mesoderm begins with Platyhel- 
minthes and marks the next step. 

To illustrate what we mean by such a rule which determines the se- 
quence of steps, let us give here a hypotheticai example, which, most like- 
ly, has no meaning in reality. It is conceivable that at first a given group 
of biological functions in the “animal” partial graph begins to specialize, 
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and that the specialization of other functions comes the later, the greater 
their distance, measured in terms of minimum number of intermediate 
points, is from a given point. The starting point may be, for example, the 
center (or the two bicenters) of the tree, obtained by removal of certain 
specified lines of the primordial graph. 

One interesting consequence of the proposed T must be noted. While 
the number of possible transformed graphs is very large (we shall discuss 
it below), it is nevertheless finite. When, with the exception of the residual 
biological functions, all others are completely specialized, so that every 
tissue corresponds to only one specialized biological function, the process 
stops. 

The question is whether the application of T does stop here, or whether 
it can now be applied to this higher unit, perhaps with a different choice 
of parameters. In other words, we can now consider the final transformed 
graph as a primordial or as a starting point of the transformation T. Now 
the specializable biological functions may be different, but the transforma- 
tion rule remains. If this is so, we obtain in this way a set of graphs, which 
connect higher organisms in their interreaction, much in the same way as 
the cells are connected in an organism. In other words we construct graphs 
of social relations. In human society the division of labor and social dif- 
ferentiation are not connected with physiological and anatomical differen- 
tiation. This is rather the exception than the rule. In insect societies social 
and biological differentiations are sometimes observed together. 

Tentatively we suggest as extension of the principle of bio-topological 
mapping the following principle of socio-topological mapping: 

All possible relations between individuals and social groups are repre- 
sented by graphs which are transformations of the biological graph of the 
individual. The transformation is the same as in the bio-topological map- 
pings, except for a possibly different choice of the parameters. 

The two principles together give us a complete unity of the biological 
and social world. 

These considerations bring out, however, another aspect of the problem. 
We see that some primordial cell may be considered as a unit of first order, 
while a metazoan plays the role of a unit of second order, being in the same 
relation to a social aggregate as a cell is to the organism. We also noticed 
the high complexity of the graph of even a simple protozoan. Is it not pos- 
sible that the graph of a cell is already a transform of a graph of a living 
unit of smaller order, say, a living molecule? Evidence for or against such 
a possibility may be obtained by an experimental study of some partial 
graphs of a cell with the application of theorem 2. We have already re- 
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marked that certain points, for example, Si, Des, Cs in Figure 1 actually 
stand for several points each. If those several points are subsidiary points, 
then by theorem 2 each subsidiary point belonging to, say, S;, should be 
connected with each other point of the graph in the same way. Whether 
this is so can in principle be verified by physiological and biochemical ex- 
periments. If the experiments confirm the above, then on the basis of the 
principle of the bio-topological mapping we will have to conclude that 
there exists a lower primordial graph, perhaps that of a living “undifferen- 
tiated” molecule. Such a living molecule would perform the essential liv- 
ing functions (cf. p. 328). Then it may gradually differentiate and form a 
complicated aggregate of molecules, which all perform different functions. 

In connection with this possibility, one would naturally think of viruses. 
All evidence, however, indicates that viruses are derived from cells, rather 
than vice versa (Luria, 1953; Pollard, 1953). Also the parasitic nature of 
viruses precludes them from being considered as the primordial organisms. 
The idea of “naked genes,” as primordial organisms, has, however, been 
suggested. The big problem is why, if such primordial organisms exist, we 
do not find them now. In this connection we may point out that should a 
living virus-like molecule exist which “feeds” on dead organic matter or 
even on inorganic matter like an autotrophic plant, it may easily escape 
detection. Viruses were discovered by their effects. If putrefaction, for 
example, is affected by some submicroscopic organism, but if it is also 
usually the seat of bacteria, due to their universal presence, then, unless 
very precise quantitative measurements of all metabolic rates involved 
are made, nothing will suggest the presence of the unknown organisms. In 
any case, the study of partial graphs of some cells will contribute to this 
important question. 

The great variety of possible transforms is due to the many possibilities 
of choosing the number m in Tj, the v,’s in 74, and the 1,’s for a fixed m. 
Analytical expressions for the total number of possibilities shall be given 
elsewhere. The number of possibilities increases very rapidly with the 
number of specializable points of the primordial graph. Considering in 
Figure 1 the points S;, Cs, E, P.o,, A, and P,, as belonging to the residual 
partial graph, we have a total of 17 specializable points. Assuming, in line 
with the known fact, that the first specialization step divides the biological 
functions in two groups, vegetative and animal, we find the total number 
of possibilities of the order of 10°. ; 

The total number of animal species and plant species is of the order 
of 10°. The plant kingdom would be represented by such choice of parame- 
ters, which corresponds to the specialization of vegetative functions only. 
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The total number of theoretical possibilities is thus well in excess of the 
number of the known number of living species. This excess is further ac- 
centuated if we admit the possibility that species, or at least subspecies, 
differ not in their graphs, but in metric relations which give the intensities 
of the different biological functions. If this is so, then the total number of 
possibilities should give us the total number of genera, rather than of 
species. The number of species per genus varies very much. For example, 
there are 70 genera of Brachiopoda, with 225 species (Storrer, 1951), or 
about 3 species per genus. There are 6 genera and 30 species of Chaetogna- 
ta (Storrer, 1951). On the other hand, in the plant kingdom there are, for 
example, about 150 genera of Cyanophyta, containing 1400 species, or 
about 9 species per genus, and about 300 genera of Chrisophyta, contain- 
ing about 5,700 species, or 17 species per genus (Fuller and Tippo, 1949). 
Taking as a plausible average 10 species per genus, the total number of 
genera is of the order of 10°. 

However, the assumption that species within the same genus differ only 
metrically may not be true. Moreover, not only have there been a large 
number of extinct organisms, but there is no reason whatsoever to assume 
that all possible organisms have by now developed. In fact, new species do 
appear. Thus even 10’ possibilities may not be too large. 

On the other hand, it is possible that the transformation T is too gen- 
eral, and that a restricting condition may have to be added, which will re- 
duce the number of possibilities. The restriction may be of a purely biologi- 
cal nature, requiring, for example, that only biological functions ‘‘adja- 
cent” on the graph can be specialized together in the same tissue. Possibly, 
however, the restriction may be of a purely topological character. For ex- 
ample, of all the transforms only those may be admissible whose funda- 
mental group is homomorph to that of the primordial graph. Whether or 
not this is a restriction cannot be said without further study. 

One of the next steps should be a correlation of existing species with 
theoretically possible graphs, and prediction from the residual of theoreti- 
cal possibilities both of what species existed before, and what kind are to 
appear in the future. 

The theory of bio-topological mapping, as outlined above, deals only 
with the interrelations of biological functions. The knotty problem of or- 
ganic form is not touched by it. Yet topological relations in that problem 
are rather striking, as mentioned already (p. 322). Clearly a complete bio- 
topological theory needs an additional general principle to connect the 
topology of biological functions with the topology of form. 

It may be possible to use at this point to advantage the principle of 
maximum simplicity (Rashevsky, 1948) mentioned above. In view of the 
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vagueness of the term “simplicity,” it may be better to speak of the 
“principle of optimal design,” a terminology suggested by D. Cohn (1953) 
in his work on the structure of the vascular bed. Given the graph of the 
organization of biological functions, we may attempt to determine the 
size and shape of the organism by such considerations of optimal design. 
Whether the problem possesses unique solutions can be determined only 
by further studies. In view of the promising initial results, such an at- 
tempt seems to be worth while. 

In some cases the gross topology of a metazoan can very directly be 
related to its graph. Thus if ingestion J and defecation D, are not differen- 
tiated and specialized by different organs, but belong to the same organ, it 
implies the same organ of ingestion and defecation, that is, “two way 
traffic” (Buchsbaum, 1938) or one opening into the alimentary cavity, as 
in Coelenterata and Platyhelminthes. This makes the gross shape of the 
organism that of a topological sphere. When J and D; belong to different 
organs, this implies an entrance and an exit, “one way traffic,” and makes 
the gross shape a topological torus, as in the case of all animals beginning 
with and above the Nemertea, if we do not consider gill clefts and nasal 
passages, which actually make the form that of multiple torus. 

Another approach is more interesting mathematically, and may prove 
to be more general. It may be possible to find a topological principle 
which relates the biological function graph to a particular three-dimen- 
sional bounded topological complex. For two-dimensional pseudo-mani- 
folds such studies have been made in the relative graph theory (Kénig, 
1936, p. 196). Certain graphs can be drawn only on certain topological 
surfaces. It is, of course, a big question whether the topological description 
of an organism should be made in terms of the surfaces which bound its 
different parts, or in terms of three-dimensional topology. In either case 
present day topology may need further development. 

If we come to the “fine” topology of an animal, we face a still different 
problem. Consider the vascular bed of a higher animal. The surface of 
the blood vessels is of a very high Eulerian characteristic. But the exact 
value of the characteristic certainly varies from individual to individual. 
It may perhaps be interesting to develop a sort of “statistical topology,” 
dealing with surfaces of very high Eulerian characteristic, and establishing 
theorems which give not exact specifications of the different topological 
properties of the surface, but the probabilities of their occurrence when 
probabilities of other properties are given. The same may be done for 
graphs. A germ of such a theory may be contained in some of A. Rapo- 
port’s work on random nets (Rapoport, 1948). 

An important question in that connection is that of extremities. The 
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“gross” topology of the external shape remains the same, whether the ani- 
mal has none, two, or any other number of extremities. Yet their number 
is important both in determining some physiological performances of the 
animal as well as its shape (Rashevsky, 1948). While reserving the study 
of this problem for another paper, we would like to suggest the following 
approach, which is closely related to the basic idea of this paper. 

The more complicated an animal, the more complicated the movements 
which it performs. The various movements of an animal, including those 
of its extremities, body, larynx, etc. form a continuous group of rather 
complicated structure. Considering an animal as a “system of linked 
levers” (Rashevsky, 1948), we notice that the (linear) topology or struc- 
ture of the lever system is closely related to the structure of the group of 
motions which it can perform. Vice versa, the group of motions may well 
determine the arrangement and structure of the lever system. If the group 
of motions of a higher animal is homomorph to the group of motions of a 
protozoan, as the principle of bio-topological mapping would lead us to 
expect, and if we establish the rule of the homomorphic mapping of the 
two groups in its dependence on the transformation rule of the organismic 
graph, we shall not only have determined the number and shape of the 
extremities of the higher animals, but also, for example, the number of 
tentacles of a hydra or an octopus. The actual sizes of the different parts 
of the extremities would then be determined by metric considerations, 
so as to satisfy the quantitative requirements in the performance of cer- 
tain functions (Rashevsky, 1948). 

Finally we must touch briefly on the all important question: How is the 
principle of bio-topological mapping to be incorporated into the scheme of 
physical laws which govern the organic world? Two possibilities present 
themselves. 

The first is for the physicist to derive the possibility of a living molecule 
or simple small aggregate of molecules, proving its stability as a dynamic 
system and hence its ability of continued existence. Then it would be 
necessary to prove a general theorem to the effect that any physical sys- 
tem, derived from such a “primordial” by a transformation rule T or 
another related rule, has the same stability properties. This would reduce 
the principle of bio-topological mapping to already established physical 
principles. 

The second possibility envisages a much more radical approach. Al- 
though the question of reducing electromagnetic phenomena to some gen- 
eralized metric properties of the space-time world is not yet definitely 
solved, it is rather generally accepted that the physical phenomena are 
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manifestations of the metric properties of space-time. In physics, metric 
properties are of prime importance and are most striking. In biology, they 
are also very important, but purely relational or topological properties 
are at least as characteristic of life as the metric properties. It is, therefore, 
natural to think of the possibility that phenomena of life may be not the 
results of physical phenomena, but that both are expressions of the ge- 
ometric properties of the space-time universe, the latter being the expres- 
sion of metric properties, the former of the topological properties. 

The four-dimensional universe is considered as a topological hyper- 
sphere, with a rather regular metric, which shows some rare irregularities 
here and there, where matter is present. Is it not conceivable to also have 
small local topological irregularities, in which certain relational properties 
of world-lines of different particles appear, relational properties which are, 
in general, absent in most of the universe. Topology of multi-dimensional 
manifolds is still not too developed and properties of one- or two-dimen- 
sional manifolds cannot as a rule be generalized to higher dimensions. 
However, to illustrate what we have in mind we shall use a two-dimension- 
al model. Consider a large sphere to which several very small “ears” are 
attached in a limited region, so that actually the whole manifold is a mul- 
tiple torus. Let the dimension of the region be very small compared to the 
radius of the sphere. Any geodetics (world-line) on the sphere which does 
not pass through the region is zero homotop and behaves as if the whole 
structure were a topological sphere. Any two geodetics are homotop, if 
they do not pass through the region. Because of the smallness of the re- 
gion, the probability of a geodetic passing through it will be very small. 
But for geodetics passing through that region things are different. Even 
on a simple torus two geodetics are not necessarily homotop: Consider a 
system of geodetics such that no two are homotop. The system described 
by these geodetics will obey the laws of (two-dimensional) physics, since 
the world-lines of its particles are represented by geodetics. Yet it will 
show such relations between the world-lines of its particles which cannot 
be derived from the properties shown on the sphere, away from the region. 
Formally this does remind us of the properties of living systems: they 
manifest themselves through processes that are obeying all laws of phys- 
ics, yet they seem to possess relational properties which while not contra- 
dicting them, do not follow necessarily from those laws. 

In view of the fact that even the incorporation of electromagnetic 
phenomena into world geometry still perhaps defies the ingenuity of the 
greatest physicists, the difficulties of the above approach cannot be over- 
emphasized. This is still only a dream or vision, which may be fantastic. 
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But if ever this dream comes true, then pure mathematics, physics, biol- 
ogy, and with it the social sciences will be blended into an inseparable 


whole. 
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AWARDS FOR POST-DOCTORAL STUDY IN STATISTICS 
AT THE UNIVERSITY OF CHICAGO 


Three $4000 post-doctoral fellowships in Statistics are offered for 1955- 
56 by the University of Chicago. The purpose of these fellowships, which 
are open to holders of the doctor’s degree or its equivalent in research 
accomplishment, is to acquaint established research workers in the bio- 
logical, physical, and social sciences with the role of modern statistical 
analysis in the planning of experiments and other investigative programs, 
and in the analysis of empirical data. The development of the field of 
statistics has been so rapid that most current research falls far short of 
attainable standards, and these fellowships (which represent the fifth year 
of a five-year program supported by The Rockefeller Foundation) are in- 
tended to help reduce this lag by giving statistical training to scientists 
whose primary interests are in substantive fields rather than in statistics 
itself. The closing date for applications is February 15, 1955; instructions 
for applying may be obtained from the Committee on Statistics, Univer- 
sity of Chicago, Chicago 37. 
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